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FOREWORD 


T HIS volume by Kenneth Lee Pike has grown out of some seven 
years of practical experience in the field attempting to analyze 
and describe the sounds of a number of non-Indo-European languages. 
For much of this time Dr. Pike lived among the Mixtecos in south- 
western Mexico, an Indian people numbering about 200,000. The 
language of these people he has mastered and is describing in several 
monographs. His experience, however, has not been limited to this 
particular language. As the phonetics specialist of the Wycliffe Bible 
Translators who have language investigators working in each of the 
important Indian languages of Mexico, and as the professor of pho- 
netics of the Summer Institute of Linguistics in which these and other 
such investigators are trained, he has assisted in analyzing and de- 
scribing the sounds of a wide variety of the languages of so-called 
primitive peoples. Phonetics, which is now followed by a second 1 
and a third volume on related subjects, first reviews critically the 
basic assumptions of phonetic theory and then proceeds to clarify and 
restate many of these assumptions. Upon this foundation the author 
constructs basic definitions and classifications of such items as phone, 
syllable, stop, vowel, and consonant, and provides a descriptive order 
in which all sounds may be described in relation to their productive 
mechanisms. 

The end sought is the description of any sound apart from speech 
in order that factors of speech may not give undue influence to the 
terminology applied to it. To accomplish this end it was necessary to 
examine the relationship between speech and nonspeech sounds and 
to establish a technic of description which could deal with all nonsense 
sounds and syllables as well as with those of language. In other 
words, it was necessary to divorce pure phonetics, the description of 
the nature and formation of sounds, from phonemics, the function of 
sounds in speech. This volume, therefore, does not deal with the 
speech function of sounds, nor with the practical methods for finding 
the phonemes of a language, nor even with the formation of phonetic 

1 The second volume, T one Languages (University of Michigan Press) , deals 
with the nature of tonal systems and provides a technic for the analysis of then- 
significant pitch contrasts. 
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orthographies. In fact, it is not a conventional “ practical phonetics ” 
book. 

The present volume is not “ impractical,” however, because it covers 
only one part of the field. On the contrary, the material here pre- 
sented is eminently practical in that it provides the data which places 
speech in a new perspective. It leads the investigator to see that 
sounds have a subphonemic segmentation which underlies all speech 
and actually limits the forms into which languages are approximately 
molded. In addition, it lays the groundwork for a different approach 
to phonemic units by way of segmental units — an approach which is 
already proving of value in experimental classes given to research 
workers in the phonemic field. 

This reworking of phonetic theory should prove of great value in 
another direction. It seems now to be fairly well recognized that an 
instrumental study of the acoustic reality of sounds or of the physical 
minutiae of their production has little practical bearing upon language 
problems until it has been correlated with the perceptual reactions of 
speakers to these sounds. These perceptual reactions must be clas- 
sified in some way. Often those who deny the necessity of any per- 
ceptual correlations really use perceptual classifications of an ex- 
tremely naive type. It is to be hoped that laboratory technicians will 
find the new classifications offered here of considerable assistance in 
providing the perceptual assumptions basic to the interpretation of 
the actual physical data which they uncover. 

The chief practical contribution of this volume, however, may be to 
those students of general phonetics who intend to reduce to writing 
the languages which are as yet without literature. To these students 
this contribution will not be the furnishing of specific technics for 
alphabet formation , 2 but the provision of the technical background 
for the analysis of sounds which is prerequisite to any such technics. 
Specific alphabet-forming technics are of little value to an investigator 
who cannot recognize and transcribe sounds — and features of sounds 
are most readily recognized when the total system of sound production 
is understood. The present volume provides this basic analysis of the 
formation of sounds. 

Charles C. Fries 

2 The third volume, Phonemics (University of Michigan Press), provides 
for this correlative need. 
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INTRODUCTION 

P RESENT phonetic classifications are based on a limited number 
of sounds. Many sounds actually occurring in speech, in addi- 
tion to many types which have not been found in language, are fre- 
quently ignored, while others are mentioned but briefly or, if discussed 
at all, grouped together into small unorganized sections where they 
fail to contribute their potential share to the establishment of adequate 
analyses. Different kinds of data have contributed to the analysis and 
classification of vocal sounds. Movements of the vocal organs have 
given the basis for physiological description, but this material has been 
modified by the acoustic nature of the sounds and by the function 
which they perform in speech systems. Many fundamental assump- 
tions which have never been adequately stated have become estab- 
lished in phonetic descriptions. These assumptions have tended to 
conceal the relationship between the various types of data, and a 
number of them have discouraged the investigation of certain sounds 
that would have called attention to many of the assumptions them- 
selves. 

Part I of the study here presented reviews phonetic theory in sev- 
eral of its fundamental phases in an attempt to weigh the factors in- 
volved in some of these problems. Chapter I shows the minor place 
given in phonetic literature to many types of sound that occur in 
speech, and discusses some of the reasons for the difference in treat- 
ment between these sounds and those which are accorded more promi- 
nence. The second half of the chapter describes briefly four possible 
methods for investigating sounds and indicates which one has been em- 
ployed for the analysis of the material underlying this study, with 
certain reasons for its choice. Chapter II points out additional Sound 
types which current phonetic classifications tend to ignore and calls 
attention to other undefined assumptions underlying this neglect, but 
here the sounds discussed are one step further removed from speech. 
The boundary line between the sounds considered in the first chapter 
as contrasted with those discussed in the second is somewhat vague, 
chosen perhaps as much for convenience as on the basis of structure. 
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Chapter III shows some of the problems involved in cutting a con- 
tinuous stream of speech into segments. It postulates the existence 
of a nonfictitious phonetic segmentation which is not dependent upon 
phonemic divisions, but which to a large extent conditions those di- 
visions; the segment itself is not defined until Chapter VII. Chapter 
IV brings forward evidence that the cavities of the vocal mechanism do 
not have identical functions in producing sound, and that closures, or 
partial strictures, or movements likewise have varied places in the 
economy of sound production. Many of these differences are shown 
to comprise basic and essential assumptions in all articulatory descrip- 
tions of sounds which must be retained but which should be defined. 
Chapter V traces the mixture of articulatory, acoustic, and phonemic 
criteria in consonant-vowel differentiation. It suggests the elimina- 
tion of certain of the criteria and the expansion of others. Part I 
analyzes various key difficulties in phonetic description. After these 
factors have been reviewed, Part II presents a constructive system 
which attempts to embody the suggestions contained in this analysis. 



CHAPTER I 


MARGINAL SOUNDS 

S OUNDS occurring infrequently as phonemic norms, or appearing 
but rarely in certain set types of speech, comprise a group of mar- 
ginal speech sounds. Data which they can contribute to a phonetic 
analysis tend to be neglected or given too little weight in the estab- 
lishment of basic phonetic classifications. The explanation for the 
shunting of marginal sounds to one side seems to be the assumption 
of standards of normality for the selection of those speech sounds 
which are allowed to contribute criteria for the establishment of basic 
classes. 

Many investigators 1 in defining vowels include as one of the es- 
sential characteristics that all vowels should be voiced, rejecting other 
varieties as abnormal. Jones, for example, says that “ whispered 
speech is not regarded as normal.” 2 Westermann and Ward use al- 
most identically the same words . 3 Noel-Armfield, as if it were not 
sufficient to exclude whisper from vowel definition , 4 discusses it in 
a small section 5 together with inverse sounds, clicks, and consonants 
with glottal closure — types entirely extraneous. Passy does like- 
wise, both in excluding it from his definition of vowel , 6 and in rele- 
gating it to a small section 7 containing rather rare speech sounds 

1 Gairdner, The Phonetics of Arabic, n; Jones, An Outline of English Pho- 
netics 4 , §97, see also §255 and p. 23 n. ; Kenyon, American Pronunciation 6 , 
57-58; Nicholson, A Practical Introduction to French Phonetics, 9; Noel-Arm- 
field, General Phonetics 4 , 8, 31; Passy, Petite phonetique comparee 3 , 14; idem, 
The Sounds of the French Language 2 , 56; Ripman, Elements of Phonetics, 10; 
Stirling, The Pronunciation of Spanish, 6; Sweet, A Primer of Phonetics 3 , 13; 
Vietor, German Pronunciation 5 , 7; see also Fletcher, Speech and Hearing, 5, 7. 
For further discussion of vowel definition see Chapter V. 

2 Jones, 23 n. 

3 Westermann and Ward, Practical Phonetics for Students of African Lan- 
guages, 13. 

4 Noel-Armfield, 8 n. 

6 Noel-Armfield, 120-23. 

6 Passy, Sounds, 56; Phonttique, 14. 

7 Passy, Sounds, 86-88; Phonitique, 1 10-13. 

S 
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(clicks 8 ), rare variants of speech sounds (inverse 9 ), and nonspeech 
sounds (whistle 10 ). These, and elsewhere glides 11 also, he calls “ ac- 
cessories,” 12 giving ample evidence that he considers them abnormal. 
Even more surprising as “ accessories of language ” are expression of 
the face, gesture, and the language of signs . 13 This classification is 
made in spite of the fact that he lists whisper as occurring regularly 
in some languages . 14 

Although whisper is thus given an unsatisfactory classification, it 
is discussed or at least mentioned in many places . 15 Related problems 
of breathed or voiceless vowels , 16 and “ voiced [h] ,” 17 receive less 
attention. 

Conditions of the vocal cords other than those associated with 
voice, whisper, and breath are usually completely ignored. Singing 
is used for an occasional illustration . 18 Vibrato is not considered . 19 
Falsetto is rarely mentioned . 20 Types of vocal trill or murmur are 
not clearly differentiated, but a few receive passing notice . 21 

The second major criterion of normal usage often applied is that 

8 Passy, Sounds, 86-87; Phonitique, 113. 

9 Passy, Sounds, 110-12; Phonetique, 86. 

10 Passy, Sounds, 88; Phonetique, 113. 

11 Passy, Sounds, 89. 

12 Passy, Sounds, 86; Phonetique, no. 

13 Passy, Sounds, 89. 

1 4 Ibid., 87. 

15 Bloomfield, Language, 95, 102; Curry, The Mechanism of the Human 
Voice, 30-31; Gairdner, 11, 32; Jones, 23 n., §§ 82, 89; Kenyon, 38, 49; idem in 
Webster’s New International Dictionary of the English Language 2 , §§ 16, 44 
(10); Noel-Armfield, 8n, 121-22; Passy, Sounds, 15, 56, 87; Ripman, 7, 21; 
Sweet, 10, 12, 22-23, 28 ; Westermann and Ward, 13, 85. 

See also C. B. Miller, An Experimental-phonetic Investigation of Whispered 
Conversation; D. C. Miller, The Science of Musical Sounds, 23s; Rousselot, Prin- 
cipes de phonetique experimental (New Ed.), 468-78. 

16 Jespersen, The Articulations of Speech Sounds Represented by Means of 
Analphabetic Symbols, 70-71; Kenyon, 47, 139; idem in Webster, §44 (10); 
Noel-Armfield, 31, n. 1; Sweet, 22. 

17 Jones, §779; Kenyon, 47; idem in Webster, §44 (10); Noel-Armfield, 
30; Ripman, 21. 

18 Jones, 70 n. ; Passy, Sounds, 55 ; Ripman, 13; Sweet, 53-54, 66-69. 

19 But see The Vibrato, edited by Seashore ( Univ . Iowa Studies, Studies in 
the Psychology of Music, Vol. I). 

20 Curry, 46-47; Sweet, 10; Rousselot, 252 (and 479-82 for ventriloquism) ; 
Duyff “ Petite contribution a la connaissance de la voix de fausset,” Arch. Neer. 
Phon. Expir., 4 (1929), 67-71 [with bibliography], 

21 Bloomfield, 95, 99, 101, 102, 112; Jespersen, 32, 75. 
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a sound should be made with air leaving the lungs. Passy says, “ All 
speech sounds have their origin in a single physiological act — respira- 
tion — modified in various ways.” 22 Later in the same work 23 he 
distinguishes between clicks, which have no air going below the glottis, 
and inverse sounds. The contradiction is explainable only on the 
ground that in the first instance Passy included sounds which he 
considered normal, and hence that clicks are not normal and therefore 
excluded. The abnormality of clicks in his opinion is further evidenced 
by their being given but slight space in an unclassified catch-all section 
of “ accessory sounds.” 24 

Ripman states that a current of air from the lungs “ is the essential 
element of all speech ,” 25 without discussing click types; he does men- 
tion “ inhaled [inverse] breath.” 26 

Kenyon makes no qualification of his statement, “ The sounds of 
speech are produced by breath forced from the lungs and modified by 
the vocal organs,” 27 except by implication when he says of stops, 
“ Outward breath-pressure is assumed in all cases.” 28 The fact that 
he is dealing with just one language will not completely explain his 
assumption when other writers 29 show various types of inverse and 
click sounds as interjections in English . 30 Elsewhere he mentions in- 
terjectional sounds not in the English phonemic system 31 and, also, 
even foreign sounds . 32 

These quotations reflect the normative attitude of other writers, 
but are more strongly phrased. Noel-Armfield’s chapter which dis- 
cusses clicks and inverse sounds has , 33 like Passy ’s, no organization 
except as a container of what he thus implies are abnormalities; in it 
the presence of whispered sounds which he had previously excluded 
from his basic phonetic classification 34 suggests that these other 

22 Passy, Sounds, 6; “ General Discussions of Respiration and Its Modifica- 
tion,” 6-g. 

23 Passy, Sounds, 87. 25 Ripman, 1. 27 Kenyon, 38. 

24 See n. 12. 26 Ripman, 2. . 28 Kenyon, 40. 

29 Bloomfield, 94; Noel-Armfield, 120-21; Sweet, 43. 

30 I personally know a speaker of American English who now and again 
uses glottalized stops as variants of [t] and implosive stops as variants of [b]. 
Jones, § 571, mentions ejectives occasionally employed by French people. 

31 Kenyon, 47-48: [mmm; mm?m], 

32 Kenyon, 48: [13 . 

33 Noel-Armfield, 121-23. 

84 Noel-Armfield, 8 n., 31 n. 
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sounds are also excluded. As late as 1912 the alphabet of the Inter- 
national Phonetic Association included merely symbols for sounds 
made by air from the lungs only. 35 

Various writers discuss or refer to types of stops made without 
air from the lungs: glottalized stops (ejectives), 36 implosives (glot- 
talic clicks), 37 clicks (velaric clicks), 38 and egressive clicks (reverse 
clicks) . 39 When sounds made with air entering the lungs receive men- 
tion, 40 they are treated perhaps with slightly less feeling of abnor- 
mality. 

Sounds produced or modified by larynx, pharynx, false vocal cords, 
and epiglottis are often considered abnormal in spite of the physiologi- 
cal normality of their processes. These sounds are paid but scant at- 
tention 41 in description of the vocal apparatus. They seem little 
understood, since they are difficult to produce and, furthermore, have 
but slight place in phonemic systems. 

Certain sonorants, [ 1 ], [r], [m], [n], are considered perfectly 

35 The Principles of the International Phonetic Association (Supplement to 
Le Maitre Phonetique), 10. 

36 Bell, Visible Speech (Inaugural Ed.), 126; Bloomfield, 99; Jones, § 570 ; 
Noel-Armfield, 3, 122; Sweet, 59; Westermann and Ward, 96. 

37 Bloomfield, 93-94 (suction sounds) ; Passy, Sounds, 87 ; Sweet, 43 ; Ward, 
An Introduction to the Ibo Language, 6; Westermann and Ward, 92. 

38 Bell, 56, 126; Bloomfield, 93-94; Jespersen, Lehrbuch der Phonetik 4 , 124; 
Noel-Armfield, 120-21; Passy, Sounds, 87; Sweet, 7; Westermann and Ward, 98. 

38 Bell, 62. 

40 Noel-Armfield, 120; Passy, Sounds, 86; Ripman, 2 ; Sweet, 43. 

41 For mention of larynx or pharynx see Curry, 37, 74, 15S-56; Jones, 70, 
n. is; Muckey, The Natural Method of Voice Production, 18, 85, 14S ; Noel- 
Armfield, 4-5, 107, 109-12; Passy, Sounds, 20-21; Ripman, 9; Ward, “ Phonetic 
Phenomena in African Languages,” Archiv fur vergleichende Phonetik, 1 (1937), 
51 ; Westermann and Ward, 85. Compare sounds of “ bronchial tubes ” mentioned 
in Passy, Sounds, 84; Sweet, 12. See also Carmody, “An X-Ray Study of 
Pharyngeal Articulation,” Univ. Calif. Publ. Mod. Philol., 21, No. 5 (1941), 
377 - 84 - 

For mention of false vocal cords see Jespersen, Articulations, 14; Jones, § 82 ; 
Muckey, 70-73; Ripman, 5; Russell, Speech and Voice, 232, 236-37; Sweet, 8. 

For mention of epiglottis see Doke, “ The Phonetics of the Zulu Language ” 
( Bantu Studies, Vol. 2, Special Number), 33; Negus, The Mechanism of the 
Larynx, 39-44, 229, 449 ; Noel-Armfield, 4-5 ; Passy, Sounds, 12 ; Russell, 210-14 ; 
idem, The Vowel, 117-26. 

For mention of lungs see Russell, Speech and Voice, 215-17. 

For discussion of the action of the laryngeal ventricle see Pepinsky, “ The 
Laryngeal Ventricle Considered as an Acoustical Filter,” Journ. Acoust. Soc. 
Am., 14 (1942), 32-35. 
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normal when voiced ; the voiceless relationship 42 provides a problem 
similar to that of the vowels (see “ Classification Criteria,” Chap- 
ter V). 

Back unrounded and front rounded vowels are frequently treated 
as abnormal, 43 in contrast to front unrounded and back rounded 
vowels. 

Many writers, 44 especially those dealing with single languages, 
make no mention of any of the marginal sounds discussed above ex- 
cept for items from the last group when they are phonemic in the 
language studied. In general, these writers are the same as those 
who make no mention of nonspeech sounds. 45 The advantage which 
such writers, and especially teachers of phonetics of single languages, 
may gain by the utilization of marginal and nonspeech sounds will 
be discussed later (p. 24). 

Within the description of sounds as a whole, much is to be desired. 
The assumption that only egressive air from the lungs is normal has 
shut off many comparative data to be derived from clicks, glottalized 
stops, and the like, data which would have led to the description of 
more factors concerning air streams and their chambers and passage- 
ways. No discussion shows that both compression and rarefaction (in 
turn) may be applied to any sound type; no attempt is made, for ex- 
ample, to find vowels with a click mechanism or trills with pressure 
from the larynx. 

The extent of an air chamber seems not to be defined ; the differ- 
ence between the function of the mouth chamber during a nasal con- 
sonant and its function in oral vowel production is not fitted into a 
system of air chambers. The varying usage of closures and partial 
strictures for limiting such chambers as distinct from causing air 
pressure or rarefaction by their movement, or causing friction by their 
impeding of an air stream remains unclassified. See “ Strictural Func- 
tion ” (pp. 56-65, 129-36) for fuller discussion of the functional 

42 I have found no clear presentation of the problem, but cf. Jones, 24, n. 3; 
Kenyon, 47-48, 60, 127, 137; Noel-Armfield, 26-27, 82 > 108. 

43 Passy, Sounds, 58-60 ; Phonttique, 94-95. For secondary cardinal vowels 
see Jones, 35; Noel-Armfield, 16. 

44 Forchhammer, How to Learn Danish*; Jones and Woo, A Cantonese 
Phonetic Reader; Karlgren, A Mandarin Phonetic Reader ( Archives d’ Etudes 
Orientates, Vol. 13) ; Nicholson; Vietor. 

45 See p. 34, n. 25. 
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differences. Presumably failure to perceive the relationship of the 
glottal stop to its pressure chamber is what caused Jones to say that 
it is “ neither breathed nor voiced ” 46 (in contrast to other stops 
which may be either). 

Although in regard to glottalized stops various investigators 47 
have mentioned the rising action of the larynx to achieve the compres- 
sion required, and others, less felicitously, have mentioned in that 
connection, or in relation to clicks, an “ inner closure,” 48 yet I find 
no evidence of anyone's hunting for a stricture having the same 
function but lying in reverse position (i.e. outward in respect to the 
releasing stricture). Such phenomena can be found (see pp. 88, 
101-2); they are more important for forcing classification of types 
of air chambers and functions of strictures, the abandonment of 
casual assumptions, than for their own frequency of occurrence. 

The relegation of nonpulmonic sounds to an obscure place or their 
entire omission has brought with it the obscuring of basic sound 
production. No one, especially not a beginning student of phonetics, 
can expect to grasp the vocal panorama of the possibilities of com- 
pression and rarefaction when only one compressive type is given. 

The middle ground of percussion (pp. 103-5) apart from com- 
pression and rarefaction seems to be relatively unexplored, but serves 
to emphasize the classification by delineating its border lines and 
explaining a few puzzling transitional sounds, or the sounds resulting 
from the opening and closing of the lips while the remainder of the 
vocal mechanism is motionless (but while a closure is maintained at 
the glottis or the back part of the tongue; the same effect may be 
obtained by snapping the outside of the throat or cheek while the 
glottis is closed) . The quiescent air chambers for these sounds must 
also be classified. 

To obscure further the possibility of recognition of a full gamut 
of stops, fricatives, nasals, and so on produced by each of the vari- 
ous pressure-rarefaction mechanisms, stops are the only types of non- 
pulmonic mechanisms mentioned iri the textbooks; fricatives and 
nasals find no place whatsoever. Noel-Armfield’s chart, “ The Main 

46 Jones, § SS 3 - 

47 Jones, § 570; Sweet, 59. 

48 Bell, 62 ; cf. also 56 (“ suction stopper ”) ; Bloomfield, 99 ; Westermann 
and Ward, 96. 
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Types of Human Speech Sounds,” 49 demonstrates the point; three 
clicks are the only nonpulmonic sounds recorded, and these are listed 
on the chart in the same place as the plosives (i.e. stops). 

Certain types of voiced “ glottalized continuants ” 50 disguise this 
lack of nonpulmonic fricatives; actually such continuants are pro- 
duced by air from the lungs, and not at all by the mechanism of rising 
larynx which produces glottalized stops (but sometimes the two types 
may be parallel phonemically if not phonetically) . 

I find no adequate description of the combination, or possibilities 
of combination, of pulmonic types with nonpulmonic types. Nasal- 
ized clicks are well enough known, 51 but if terminology has not been 
built up which is adequate for describing the chambers and strictural 
functions of the separate types, still less has it been provided for the 
combinations. 

Because present classifications eliminate much marginal material 
from basic consideration they tend to lead to a feeling of false security, 
so that students, thinking them complete, may fail to look for more 
sound types, or may have their perception dulled to the implica- 
tions of new items which they might discover. This limitation of 
sound types in classifications has prevented the development of a 
terminology which would describe many items of sound that almost 
certainly exist in language, although at present they are unreported. 
The failure to develop classifications and terminology for nonspeech 
sounds in an inclusive system with speech sounds and marginal sounds 
militates in the same direction (see Chapter II for discussion of non- 
speech factors) . 

Normative prejudice causes in part the selection of classification 
criteria which cannot be strictly applied, and leaves contradictions in 
the material. The most important instance is that of vowel and con- 
sonant differentiation (see Chapter V, “ Classification Criteria ”). 

Present phonetic descriptions are a hybrid, being neither strict 
description by physiological structure and movement alone (with or 
without acoustic judgments added), nor statements designed solely to 
show systems of sound in speech (which should be the province of 

49 Noel-Armfield, chart facing 180. 

50 See Sapir, “ Glottalized Continuants in Navaho, Nootka and Kwakiutl 
(with a Note on Indo-European),” Lang., 14 (1938), 248-74. 

51 See Westermann and Ward, 100. 
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the phonemicist). The exclusion of whispered and voiceless vowels 
from vowel definition, the inconsistency of the boundary line between 
consonants and vowels, the inclusion of certain types of laterals but 
not others in phonetic charts, the relegation to a minor place of non- 
pulmonic sounds and of pharyngeal sounds, the exclusion of nonspeech 
sounds, all point to the fact that judgments of normality are judgments 
of phonemic usage, not of phonetic character, and as such they are 
to be deprecated. Not only speech values as a whole are allowed to 
affect the classification, but even certain very specialized speech values 
which are chosen as norms. 

Why does no language use exclusively whispered vowels? Why, 
do certain nonspeech sounds never occur in language? Why are some 
rare and others frequent? Rarity and frequency of occurrence of 
phonetic sound types as phonemes or variants of phonemes should be 
investigated by the phonemicist, but the phonetician is doing him no 
favor when he prejudges the conclusions by giving normalized classi- 
fications, and suppresses the very data that would provide the pho- 
nemic choice of sounds with a contrastive setting. 

Whenever a system of phonetic description is affected by one or 
more phonemic systems, it automatically and proportionately pre- 
vents the unbiased description of a new phonemic system (that is, a 
new language) , or of sounds in isolation apart from a phonemic system, 
or of sounds of a language before the phonemic system has been de- 
tected, or, for that matter, of any material for which a strict phonetic 
record is desired within the limits of the technic. 

The material for this volume was presented to phonetic discus- 
sion groups at Ann Arbor 52 in the summers of 1939 and 1940. Dur- 
ing the session of 1940 Dr. Bernard Bloch 53 called to my attention 
the fact that J. C. Catford had independently arrived at certain of 
these conclusions and published them in an excellent though short 
article 54 to which I had not had access. 

Catford also objects to keeping implosives, ejectives, and clicks 

52 The groups met at the Linguistic Institute (under the auspices of the 
Linguistic Society of America and the University of Michigan). 

53 Dr. Bloch is associated with the Linguistic Atlas of the United States and 
Canada. 

54 Catford, “ On the Classification of Stop Consonants,” Le Maitre Phone- 
tique, 3d Series, 65 (1939). 2 ~ 5 - 
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apart, “ as if they don’t enter into a general classification along with 
other consonants,” 55 and regards that procedure as pedagogically un- 
sound. 

Starting from the description of “ pulmonic,” “ glottalic,” and 
“ velaric ” clicks given by Beach , 56 he enlarges the classification to 
cover not only these “ suction stops ” but also “ pressure stops,” and 
does so with an admirably lucid terminology and systematic presenta- 
tion. He suggests extending the classification to fricatives , 57 and 
gives an illustration or two. 

His description of voiced implosives (glottalic suction stops) 58 is 
accurate and clear; later he describes voiced (velaric) clicks . 59 Plac- 
ing the two descriptions together, one finds he has not discussed 
various possibilities of combination of his three basic types of stops, 
and has failed completely to try to correlate the two descriptions into 
a single system of type combinations which would have suggested 
many other (speech and nonspeech) forms. Such a system should 
explain, for example, the two simultaneous but different functions of 
the velar closure during the voiced (velaric) click (the term “ inner 
closure,” used also by Bell , 60 obscures this difference), as well as de- 
limit the possible types of sounds of each mechanism which could be 
superimposed one on the other; in addition, the relationship of the 
functional parts of the air chambers involved should have been made 
clear. 

Catford’s classification is primarily for stops, and, in spite of his 
reference to fricatives, fails to show another basic point, that each 
mechanism for making sounds can produce just the same fricative, 
lateral, nasal, and vocalic types except for certain innate limitations 
(including, for example, lack of audibility in click vowels and impos- 
sibility of glottal stop with the air chamber of a [velaric] click). 

00 Ibid., 2. 

66 Beach, The Phonetics of the Hottentot Language. This work I did not 
see until 1940. 

57 Catford, op. cit. in Le Maitre Phonetique, 3d Series, 65 (1939), 2, 5. Fur- 
ther search has shown that Jones had earlier described fricatives (and stops, trills, 
and so on) of implosives and click types in “ Implosive and Click Sounds,” Le 
Maitre Phonetique, 3d Series, 22 (1907), m-14. 

58 Catford, op. cit. in Le Maitre Phonetique, 3d Series, 65 (1939), 4. 

59 Ibid., 4-5. 

60 Bell, 62. 
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Nor does Catford try to show that the pressure-rarefaction phe- 
nomena described in his three types of speech sounds are even more 
widely applicable, demonstrable as the basis for all nonspeech sounds 
except those at the halfway mark (see percussives, pp. 103-5) an d 
a further small residue (see p. 105, scrapives). 

This study attempts to present a more universal classification of 
marginal speech and nonspeech sounds than any now available. No 
effort has been made here to achieve extreme minuteness of detail in re- 
gard to point of articulation ; but, on the other hand, every sound which 
seems to demand a modification of present classifications, or which is 
a new type, has been given place regardless of its apparent lack of im- 
portance in communication. Since the line of demarcation between 
speech and nonspeech types is not sharp (see, however, pp. 149-51), 
many types actually analyzed are not specifically mentioned when 
they can be subsumed under the classifications as presented and il- 
lustrated. 

The auditory articulation technic (or, more briefly, the articula- 
tory technic ) has been employed for analysis; by this are meant de- 
scription in terms of movements and positions of the vocal organs and 
investigation primarily by the ear and the kinesthetic sense. When 
investigation of articulations utilizes instruments, the extended pro- 
cedure represents the instrumental articulatory technic. In contrast 
to these, auditory acoustic technic and instrumental acoustic technic 
analyze and describe the sounds (or sound waves) themselves rather 
than the movements producing them. The rest of this chapter will 
be devoted to a consideration of the limitations and advantages of 
each of these four technics in relation to phonetic classification. The 
first to be discussed will be the articidatory technic, as already defined 
in this paragraph. 

In phonetic literature 61 sounds are occasionally mentioned which, 
although the same or very similar, are produced by a sharply different 
action of the vocal mechanism, a difference readily seen by the eye 
or noted by the kinesthetic sense. The articulatory technic simply 
describes alternate means for the production of one sound. 

61 See Bloomfield, 108; Jones, Outline, 102, n. 6, 200 n.; Kenyon, 156; Rip- 
man, S3 ; Sweet, 29-30. 
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A different situation prevails when the articulatory variation is 
minute. Sweet’s statement that each new tongue position produces 
a new vowel, and hence that the number of vowel sounds is infinite, 62 
is only a partial truth. True, there may be a difference in the sound 
waves which instruments could record, but the ear is not infinitely 
delicate. For changes of all types the ear (with individual variation 
around a norm) has definite thresholds of discrimination beyond 
which it cannot go. This limits possibilities of discrimination between 
all sound elements, whether differences caused by minute changes of 
point of articulation or differences of prosodic factors of pitch, quan- 
tity, and stress. 63 The articulatory procedure, therefore, does not 
attempt to describe an infinite variety of sounds and articulatory po- 
sitions, but only those above the perceptual threshold; the number 
which can be perceptually discriminated is not infinite. 

The thresholds of discrimination are present even when sounds 
are contrasted one directly following the other. Any phonetician 
knows from practical experience that this power of discrimination is 
extraordinarily reduced if the sounds are not given contiguously; that 
is, if two very similar vowels are pronounced one after the other it 
may be relatively simple to say that they are different, but to identify 
which of the two appears in a sentence may be impossible. Persons 
without phonetic training come far short in their discriminatory pow- 
ers of that of which the ear is basically capable. Jones states that a 
good (i.e. expert) ear can distinguish well over fifty vowels (apart 
from nasalized or retroflex types and the like) ; 64 this number is con- 
siderably short of an infinite variety. 

This perceptual factor must be taken into account in any defini- 
tion of a phonetic unit (see below, pp. 45-46, 108-10) or else no two 
phoneticians can agree on procedure when at times they perceive dif- 

62 Sweet, 13. 

03 See Fletcher, Part III, Chapter II (“ Limits of Audition ”) , 143-66 (“ Mini- 
mum Perceptible Differences of Sound ”), 134 (Bibliography) ; idem, “ Loudness, 
Masking and Their Relations to the Hearing Process and the Problem of Noise 
Measurement,” Journ. Acoust. Soc. Am., 9 (1938), 273-93; Shohara, “An Ex- 
perimental Study of the Control of Pronunciation,” Speech Monographs, 6 (1939) , 
103-10. Dr. Shohara measured considerable deviation in pitch and especially in 
duration even when subjects tried to control their rate of speaking and to keep 
it steady. 

oi Jones, Outline, § 122. 



1 6 PHONETICS 

ferent entities. The factor reaches difficult proportions, since the 
perceptual element is highly variable and is conditioned by phonemic 
background . 65 

In contrast with the articulatory technic, which allows one to pro- 
duce unknown sounds from articulatory descriptions (in the manner 
of a cook book), a supplementary descriptive method, the imitation- 
label technic, merely gives names to known sounds. The latter places 
labels on standards otherwise established, whereas the former estab- 
lishes rough standards. The imitation-label technic may employ 
written “ descriptions ” of sounds. Here, however, the sound can- 
not be produced by following the description. Such descriptions are 
labels only; rather than serving as directions to produce the sound, 
they are merely convenient tags by which to recall to mind or to 
mention certain sounds previously learned by imitation following a 
demonstration. The label itself is unimportant ; any will serve, since 
its accuracy of description is entirely immaterial to its application. 
Demonstration (from the point of view of the teacher) and imitation 
(from the point of view of the pupil) are the all-important factors; 
the label merely serves to recall the transaction. Much language is 
learned by imitation (by listening to it spoken, or from phonograph 
records and the like), without labels being applied to the sounds. 
When labels are applied to sounds learned from imitation, with or 
without a feeling that the labels are articulatory descriptions, the 
imitation-label technic has been used. 

Sometimes extraordinary difficulty is encountered in determining 
which technic one is employing, since one can so easily be self-deceived 
into thinking that a mere label is an articulatory recipe. If a pur- 
ported description is actually an articulatory one, and not an imita- 
tion label only, other persons by following the directions given and 
fulfilling all the conditions of articulatory movement should be able 
to produce the sound described without having heard it. This test 
is to some extent conditioned by the ability of the persons attempt- 
ing to make the sound; for example, if they understand that the 
larynx needs to be lowered for a certain sound but do not know how 
to accomplish that movement, even a perfectly clear and accurate 
description of an implosive may not enable them to attain mastery 

65 Cf. the objection of Bloomfield (84-85) to phonetic rather than phonemic 
transcriptions. 
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of the sound — they first need instruction in methods of accomplish- 
ing this movement. 

I applied this criterion to Ganthony’s description of the “ Ventrilo- 
quial Drone,” in regard to which he says, “ The acquirement of the 
1 Drone ’ is the acquirement of all distant sounds.” 66 He directs one 
to take a deep breath, to hold it while making a “ retching sound at 
the back of the throat as though trying to be sick,” to utter a pro- 
longed “ ah,” exhaling slowly; this first will give a “ grunt,” then “ a 
drone,” a “ sustained clear hum.” 07 I tried to follow directions; re- 
sult: retching sound, grunt, hum could all be made (by imitation- 
label technic from memory of such sounds), but nothing emerged 
which sounded particularly distant. Presumably, his discussion was 
an imitation label, not an accurate articulatory description. It seems 
very probable that the articulatory factor which Ganthony was trying 
to describe was a specific kind of glottal tension, and, further, that 
voice teachers must train pupils in a diametrically opposite direction 
to relax and enlarge the throat. 

An interesting study might be made of deaf-mutes learning to 
speak. I attended several lectures of a class on lip reading to see 
what phonetic factors would emerge. Of course, books on the sub- 
ject 68 use the articulatory method. Teaching deaf-mutes to speak — 
no matter how imperfectly — is a decided achievement for such an ap- 
proach. The fact that deaf-mutes do not speak normally shows that 
for perfect success the articulatory technic needs to be combined with 
auditory imitation; the descriptions of the sounds would then be 
combination articulation-imitation labels. 

In an attempt to discover articulatory factors unknown to me 
and, if possible, to make a rough correlation between the terminology 
of voice training and that of my phonetic analysis, I attended a few 
beginners’ singing classes . 68 I had opportunity both to hear the in- 
structor criticize the students and to have her describe in her terms 
certain effects in singing that I deliberately produced by articulatory 

66 Ganthony, Practical Ventriloquism 3 , 13. 

67 Ganthony, 12-13. Bergen {How to Become a Ventriloquist, 23) has a 
significantly related description wherein he speaks of the ventriloquial voice as 
being a “pinched voice,” a “succession of grunts and groans,” with (23, 102) 
“ pressure on the vocal cords.” 

68 See Kinzie and Kinzie, Lip-Reading. 

69 The classes were held in Santa Ana Junior College. 
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methods with which I had previously experimented. Unfortunately 
I had too little time to make more than a few brief observations, but 
they are of interest in this connection. 

When (according to my analysis) I deliberately restricted my 
pharynx by contracting it, or moving the tongue backward, I was 
told I had “ failed to place the tone,” or “ was placing the tone at 
the back of my head ” (instead of “ between my eyes ”). Similar 
analysis was reported when I gave marked lip unrounding or low 
back-tongue positions ; more frequently these were called “ spreading 
the tone.” The instructor approved of my “ rounded tone ” when 
lips were rounded and vowels shaded toward high and front positions 
and when the back part of the tongue was raised (more than is my 
natural habit) as much as was consistent with producing the vowel 
concerned. A raucous sound made by raising the larynx, and with 
pharyngeal constriction marked especially by the approximation of 
the faucal pillars (rather than by the tongue approximating the back 
wall of the pharynx as in the pharyngeal-constriction type first men- 
tioned), was also “ spreading the tone.” A marked lowering of the 
larynx was “tightening” the throat. Failure to give proper oral 
rounding, fronting, and the like, as well as the slightest suspicion of 
glottal friction added to the voicing, was “ letting air out around the 
tone.” Any constriction of the pharynx or tension of the muscles 
around the vocal cords was immediately noted and reproved. A de- 
liberate lowering of the velum to cause excessive nasalization was due 
to the fact that I had “ closed the turbinates, the small entrances to 
the sinuses.” 

The pupils were told to “ put more [o]” in a sound to round it 
further. They were constantly instructed to “ think the tone; you 
can think it wherever you want to.” They were directed to “ think 
the tone higher ” to keep from flatting. 

One of the consequences which surprised me most centered about 
such suggestions. I had supposed that by following strict articulatory 
procedure, deliberately shifting tongue or larynx here and there to 
get certain effects, no such “ thinking ” or “ feeling ” would be the 
result. On the contrary, even in the short time in which I tried to 
apply the technic, certain of these feelings of “ placing the tone be- 
tween the eyes,” “ feeling nothing between the diaphragm and the 
eyes,” and the like, became quite real. 
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Presumably statements in textbooks such as those directing a per- 
son to do “ anything to bring the voice out of the throat ” 70 and to 
say so and so, “ trying to aim the voice toward the hard palate just 
above the upper teeth ” 71 (so as to “ help to produce a vibrant, ring- 
ing quality in the tone ”) are labels relating to relaxation and en- 
largement of the throat , 72 which in turn permit the feelings to be 
obtained. 

Since directions for beginners’ singing are accompanied by the 
living example of the teacher demonstrating both desired and unde- 
sired effects, while many of the apparently articulatory directions are 
innately impossible of prosecution, one draws the conclusion that 
music teachers use predominantly an imitation-label technic. 

Such a procedure is valid, in spite of the objection a phonetician 
may at first have to it. After all, results are the essential thing, and 
if an impressionistic description gives phonetically strange labels to 
the items involved, that is immaterial within the imitation-label pro- 
cedure provided students can be taught to sing. The discussion of the 
development of such feelings as have been mentioned above is a psy- 
chological problem with which I find myself unable to cope as a pho- 
netician. 

Even stronger arguments may be adduced for the validity of the 
imitation-label procedure, in music, because of the limitations of the 
articulatory technic itself. Not even the most ardent advocate of 
articulatory methods in phonetic instruction would attempt to teach 
a voice student to raise a note an octave by saying, “ Make the vocal 
cords open and close just twice as fast.” 

This signalizes one of the basic limitations of articulatory pho- 
netics. Prosody cannot be described satisfactorily by the articulatory 
technic. Relative acoustic judgments must be used instead. 

The question should be faced as to why voice instructors employ 
an articulatory approach for mouth formations of vowels and con- 
sonants as little as they appear to. They are using an articulatory 
method to a very minor extent when they give exercises for flexibility 

70 Karr, Your Speaking Voice, 154 (quoting Skinner). 

71 Karr, 132. 

72 Karr, 119-30. Since writing this material I have seen Bartholomew’s 
“ Physical Definition of Good Voice Quality in the Male Voice,” Journ. Acoust. 
Soc. Am., 6 (1931), 25-33; 7n this excellent article Bartholomew discusses addi- 
tional features. See also idem, Acoustics of Music, 139-59. 
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of the lips 73 and the like, but that hardly seems to exhaust its po- 
tential utility. Perhaps they are afraid that consciousness of mouth 
formations will eventually hinder the singing. That particular argu- 
ment does not appear unanswerable to a phonetician. He knows per- 
fectly well 74 that even a conscious, awkward, slow approach to a 
foreign language by conscious movements gains a closer approxima- 
tion to the native sounds, and ultimately a freer conversational style, 
than can ever be attained (by an adult) by pure imitation. In as 
much as learning to make the voice pleasant is (for most of us) es- 
sentially the same as learning the sounds of a foreign language, pho- 
neticians would not be unduly concerned at the consciousness, know- 
ing that this tends to disappear with usage. Much of it has to 
disappear, inevitably, since it is manifestly physically impossible to 
think of perhaps two movements or positions for each of, say, three 
sounds per syllable at five syllables per second . 75 The same would 
be true of singing. Although there is a possibility that a few of the 
total articulatory movements might hang over in the consciousness 
longer than others and cause considerable trouble, it appears very 
doubtful that such a residue should of itself cause abandonment of 
the technic. 

The real reason that voice teachers cannot use an articulatory 
approach for more of their problems is that none is provided by the 
phoneticians. The chief concern of the vocalist lies with the throat, 
and not with the mouth. Yet in all the literature I have seen hardly 
one single constructive pedagogical suggestion was given for making 
sounds in the pharynx , 70 for expanding or contracting the pharynx, 
releasing tension on vocal cords, and the like. No articulatory study 
of the throat remotely attempts to answer the needs of the vocalist. 
Why? 

With very rare exceptions (of which Arabic has a few), things 
happening in the pharynx do not change phonemes. The norms es- 
tablished by phoneticians have excluded from detailed investigation 
almost everything which does not represent central distinguishing fea- 
tures in phonemic systems at large. The vocalist, on the other hand, 

73 See Karr, 130. 

74 See Jones, Outline, § 25; Noel-Armfield, 1; Sweet, 2 7. 

75 See Jones, Outline, § 43 (for numbers of syllables per second). 

76 See n. 41. 
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is seldom concerned with phonemic values, perhaps preferring to 
eliminate some contrast rather than spoil a beautiful tone ; his concern 
is with nonphonemic factors almost exclusively, factors which have 
met with little or no discussion and for the teaching of which few 
pedagogical devices have been provided. Only when phoneticians 
take a larger view of their field and try to classify, study, and find 
ways of teaching all sounds, regardless of their “ practical ” value, 
only when they enter a realm a bit more like that of the pure science 
of the physicist, can they hope to discover data which eventually 
can provide vocal instructors with the procedures they need. Until 
then these instructors do well to retain tones “ made between the 
eyes.” 

Doke says, “ At the risk of appearing, to some, to be too pedanti- 
cally minute in this phonetic study, I have decided to record in all 
my examples every phonetic phenomenon which presents itself. I 
shall leave nothing to deduction. For example, it will be found that 
[n] before [z] always implies the presence of [d] between them; I 
shall write [ndz ] .” 77 He believes, therefore, that his description 
embodies a purely articulatory approach, with no mixture of imitation- 
label technic. 

This feeling of completeness is illusory, wherever found. In the 
[n] which he mentions, what is the position of the back part of the 
tongue? of the root of the tongue? of the epiglottis, the faucal pillars, 
pharyngeal wall? of the cheeks, the jaw, and entrance to the esopha- 
gus? how long was the sound held, and with what pitch and inten- 
sity? what type of transitional sounds did it have before and after 
it? Bloomfield well says of a phonetician, “Even his most ‘ exact ’ 
record is bound to ignore innumerable nondistinctive features of 
sound.” 78 

In proportion to his assumption of normal conditions, or his ignor- 
ing of certain moveable parts of the vocal apparatus, the phonetician 
inevitably uses imitation-label technic. This remains none the less 
true even when some parts of his description represent acceptable ar- 
ticulatory technic. As regards certain of the components, the descrip- 
tion (or letter of the alphabet) is an articulatory label; as regards 
others it serves simultaneously as an imitation label, demanding a 
standard to prevent the sound from being spoken with an “ accent.” 

77 Doke, 5. 78 Bloomfield, 84. 
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Sounds within the throat 79 are most often subsumed in this manner. 
Jespersen 80 states that a number of such articulatory minutiae prob- 
ably need not be considered in certain sounds; some of these find no 
place in his analphabetic system. 

Further evidence of imitation -label influence in phonetics appears 
in vowel description. The highest part of the tongue (or the closest) 
is generally chosen as the most useful classification peg . 81 That this 
is only a rough classification, for convenience, writers frankly admit, 
mentioning other factors which affect vowel quality too slightly (or 
nonphonemically) to be worth including. Such factors may be the 
position of the tongue tip , 82 the height of the jaw (which governs 
the amount of tongue recording on a palatogram ), 83 the slope of the 
tongue , 84 the cheeks , 85 and so on. Russell makes a justifiable objec- 
tion to the vowel-triangle SG theory, partly because of these and similar 
factors. Although some differences in vowel production might prove 
to be below the threshold of perception (and hence might appear to 
result in the “ same ” vowel, and be legitimately classed together for 
that reason), others may prove to differ in ways which are seldom 
phonemic, and these can conveniently be subsumed under the (only 
approximately accurate) articulatory labels which are applicable 
within limits of broad production, but which require that imitation 
be added for a reproduction which is more like the model. The vowel 

79 For references see n. 41. Certain of the qualities of resonance, such as 
“ metallic sound,” mentioned by Bloomfield (94-95), may prove to be caused by 
such factors. Cf. Muckey, 85; see also Passy, Sounds, 20-21, where lowering of 
the larynx is said to give sepulchral voice. Perhaps here also are some of the 
elements which make people’s voices different, so we can recognize them (Bloom- 
field, 76), so far as such individual characteristics are not caused by differences in 
the vocal mechanisms themselves. That many of these factors are not physio- 
logically predetermined is evidenced by the fact that one person can mimic 
another. See also n. 67. 

80 Jespersen, Articulations, 10. 

81 See Grandgent, German and English Sounds, 11 ; Jespersen, Articulations, 
18-19; Jones, Outline, §§48, 151; Kenyon, 64; Noel-Armfield, 31; Passy, 
Sounds, 57. 

82 Jones, Outline, § 124. 

83 Jespersen, Articulations, n; Jones, Outline, § 128. 

84 Sweet, 15-16. 

85 Jespersen, Articulations, 10; Kenyon, 65. 

86 Russell, Speech and Voice, 77-83; idem, The Vowel, 133-56, 245-81, 
282-304, 317-51, also Proceedings of the Second International Congress of 
Phonetic Sciences (1936), 198-205. 
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triangle, then, is seen to be a group of handy articulation-imitation 
labels; in common with the majority of other phonetic descriptions 
the most prominent and the roughly accurate adjustments find rep- 
resentation in articulatory statements, while the remainder are sub- 
sumed under the same tag in its secondary and simultaneous function 
for the imitation-label technic. 

Additional evidence that such labels are not purely articulatory 
is seen in statements by Jones: “ The value of the cardinal vowels 
cannot be learnt from written descriptions: they should be learnt 
by oral instruction from a teacher who knows them.” 87 “ The sound 
[d] is best acquired by imitation, while observing carefully the posi- 
tion of the lips .” 88 One of the functions of a teacher is “ to act as 
a model of pronunciation.” 89 

Noel-Armfield says that “ No verbal description of any sound 
met with in language will enable the student to reproduce it exactly.” 90 
Jones states excellently the relation of articulatory and imitation tech- 
nics for rougher and finer adjustments respectively . 01 

One might be tempted to conclude that all phonetic descriptions 
of sounds are arbitrary and invalid because certain features are in- 
cluded and others overlooked or subsumed in the label. Bloomfield 
objects to “ exact ” phonetic records as selecting details for descrip- 
tion by “ accidental and personal factors.” 92 His chief complaint 
(amply justified) in regard to a phonetic transcription is its incon- 
sistency, but he has “ no objection to a linguist’s describing all the 
acoustic features he can hear, provided he does not confuse these with 
the phonemic features.” 93 

I believe it is possible, however, to find a method of dividing a 
sequence of sounds into nonarbitrary segments which are independent 

87 Jones, Outline, § 140. 

88 Ibid., § 309. 

89 Ibid., §§ 45, 47. 

90 Noel-Armfield, author’s Preface. 

91 Jones, Outline, § 105: “This does not mean that the learner is expected 
to acquire vowels by ‘ simple imitation.’ On the contrary, he will find a knowl- 
edge of the organic formation of vowels of considerable use to him. But this 
knowledge is not in itself sufficient. The finer adjustments of the tongue have to 
be done by means of sensory control from the ear.” In addition to “ adjustments 
of the tongue,” Jones should have mentioned adjustments of other parts as well, 
such as the pharynx. 

92 Bloomfield, 84. 

93 Ibid. 
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of phonemes (see Chapter III and pp. 107-20) and then to choose, 
rationally, certain basic factors to be described for each of these 
sounds which would place it as a member of some general production 
type, giving a clue as to its air mechanisms and chambers, primary 
strictures, and so on (see pp. 151-56). Such a description would 
make no pretense at being complete, since it would be circumscribed 
by known limitations of the articulatory technic and by unavoidable 
perceptual variations. When the linguistically distinctive nature of 
certain sound features has been called to the attention of the phoneti- 
cian, he can add those items which have been omitted by design (since 
they are phonetic minutiae) or by oversight (due to lack of training 
in that particular item) if they are pertinent to the descriptive project 
in hand. The initial classification would be designed to provide a 
relatively stable starting point for descriptive studies of various kinds 
— not at all a complete statement of the entire gamut of articulatory 
processes involved — and give a means for the abstract phonetic com- 
parison of sounds in their major articulatory (not phonemic) struc- 
tural features. 

The articulatory technic and its analysis of marginal sounds are 
helpful in teaching. Students find it difficult to learn isolated facts 
and data which do not fit a system. The late Dr. Edward Sapir 
applied this principle to phonetics when he said that it was easier for 
a student to learn five hundred new sounds than five 94 (e.g. a student 
may find it difficult to make a voiceless lateral spirant, a “ Welsh [ 1 ] ,” 
if he has had no training in the unvoicing of known sounds, but if 
the student has acquired the general method of unvoicing any sound 
whatsoever, he can readily produce dozens of voiceless sounds which 
he has never heard, including the lateral continuant, because they fit 
a pattern). Many sounds of speech can only be seen in a system 
when they are compared with marginal and nonspeech sounds; an 
articulatory classification best answers this need. Even lectures or 
books dealing with the phonemics of a single language might well 
profit by such a brief orientation. 

Teachers of esophageal speech 95 to laryngectomized patients 

94 Dr. Sapir made the statement at Ann Arbor, in 1937, in a lecture to the 
Linguistic Institute. 

95 Cf. Jackson, “ The Voice after Direct Laryngoscopic Operations, Laryngo- 
fissure and Laryngectomy,” Arch, of Otolaryngol., 31 (1940), 23-36; Levin, 
“ Speech following Total Laryngectomy (without the Aid of the Mechanical 
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would benefit by a study of pharyngeal and oral air mechanisms, since 
they appear to be confused by these factors. On the one hand they 
deprecate any resort to “ buccal whispers ” 96 (which apparently 
come from these two mechanisms), whereas they seem to encourage 
beginners to use pharynx-air affricates in the first word that they are 
taught. 97 The more expert speakers do not employ these sounds for 
stops or affricates, but utilize esophageal air exclusively. 98 

The instrumental articulatory technic, the second of the four de- 
scriptive technics to be discussed, 99 is of especial value in measuring 
movements and positions to define precisely classifications presented 
by the auditory approach. Palatograms, 100 for example, clarify points 
of contact of the tongue in various sounds; X rays show the height 
and shape of tongue formation (primarily for vowels 101 ) ; and so on. 

The technic seems to be incapable of making a complete, signifi- 
cant classification independent of an auditory approach. For one 
thing, the strictures which it measures have varying functions in 
sound production (e.g. of causing vibrations; of controlling the 
direction of the air stream, as during nasal closure within oral sounds; 
of causing friction; and the like; see pp. 56-65, 129-36); measure- 
ment as such cannot classify these differences. 

A second factor is one already mentioned (p. 15) — the percep- 
tual thresholds of discrimination. Any classification of sounds as 
heard by the ear must take these thresholds into account, but instru- 
ments cannot find them without auditory checks. 

But the technic here can be of great service to auditory phonetics 


Larynx),” a paper read before the Northern Medical Society (Philadelphia, 
1939) ; Morrison and Fineman, “ Production of Pseudo-voice after Total Laryn- 
gectomy,” Trans. Am. Acad. Ophthalmol, and Otolaryngol., 41 (1936), 631-34; 
Schall, “ Psychology of Laryngectomized Patients,” Arch, of Otolaryngol., 28 
( 1 93S) , 581-84; Stetson, “ Esophageal Speech for Any Laryngectomized Patient,” 
Arch, of Otolaryngol., 26 (1937), 132-42; idem, “ Speech Movements in Action,” 
Trans. Am. Laryngol. Assn., 55 (1933), 29-42. 

96 See n. 95. 

97 See sound film prepared by Dr. N. M. Levin in collaboration with Dr. 
C. L. Jackson. 

98 See n. 97. 

99 The imitation-label technic (see pp. 16-18, 21-23) is not included in this 
group of four precisely because it does not describe items, but only labels them. 

100 See Jones, Outline, 79-81. 

101 See Noel-Armfield, 16-17; Russell, The Vowel; Parmenter and Trevino, 
“Vowel Positions as Shown by X-Rays,” Quart. Journ. Speech, 18 (1932), 351. 
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by investigating the minimum movement or change of position which 
the ear can register. If, for example, the instrumentalist had experi- 
mentally classified such thresholds for minute vowel change as were 
caused respectively by tip of tongue in various positions, by root 
and other parts of the tongue, wall of pharynx, faucal pillars, height 
of larynx, height of jaw, cheeks (or, if necessary, if he had classified 
thresholds for combinations of such movements), then X rays of 
slightly different vowels (which the auditory approach said were 
different but could not analyze satisfactorily) could perhaps deter- 
mine which changes of position caused the vowel changes and which 
could not have done so because they were below the perceptual level. 
Such investigations would help classify many items now subsumed by 
imitation-label technic under auditory articulation labels. 

Furthermore, if, on the one hand, such perceptual thresholds of 
movement were classified, while a technic was provided for recording 
them, and, on the other, a suitable working theory were available for 
segmentation of strictural changes in a sequence, the instrumentalist 
could cut any continuum into perceptual phonetic units. For the need 
and possibility of such a postulation see Chapter III and pages 107-20. 

In prosody, where the articulatory technic is weak, instrumental 
technic can make a considerable contribution. It can measure chest 
pulses 102 (for stress), photograph and time the movements of the 
vocal cords 103 (for pitch), and measure the length of time certain 
strictures are retained (for quantity). At present this can be done 
only imperfectly and within strict limits upon the sounds the technic 
can investigate (photographing the vocal cords, for example, seems 
for the present to be limited to certain vowels) . 

One might try to classify sounds by the auditory acoustic technic, 
the third descriptive method, and give labels to different types of 
tone quality as perceived by the ear. I have seen no attempt to do 
this systematically, and I personally can imagine no such terminology 
which would be as convenient as the articulatory type; it would prob- 
ably develop almost strictly into an imitation-label technic (since 
the standards of sound which the labels represented would have to 

102 See Haden, The Physiology of French Consonant Changes (Supplement 
to Language, Dissertation 26), 19; Stetson, “Speech Movements in Action,” 
Trans. Am. Laryngol. Assn., 53 (1933), 29-42. 

103 See Herriott, “ High Speed Motion Picture Photography,” Bell System 
Tech. Journ., 17 (1938), 393-405. 
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be established almost exclusively by samples, and the necessity of 
samples ultimately implies imitation for their production). 

Jespersen’s statement fifty years ago still seems applicable, “ Not 
even the most ardent adherents of the ‘ acoustic ’ point of view have 
ever tried to base their phonetic terminology or any system of notation 
on the acoustic properties of those sounds.” 104 The acoustic criterion 
of the presence or the absence of audible friction, however, is one of 
the most basic to phonetics, especially in consonant-vowel distinc- 
tions (see p. 70-72) and gives a valuable supplement to articulatory 
descriptions. Sonority likewise has had influence on phonetic classi- 
fications (see pp. 69-70). Harris has mentioned that differences 
exist in consonant and vowel treatment based on articulatory versus 
acoustic descriptions. 105 

The auditory acoustic technic fills an exceedingly important place 
in the linguistic analysis of prosody; neither of the articulatory tech- 
nics can handle the problem satisfactorily. The untrained person 
makes judgments of pitch, stress, and quantity by auditory acoustics. 
His are judgments relative to a vague standard or norm for a par- 
ticular individual or community (“a low, soft, drawling voice”). 

The trained musician can estimate much closer to fixed pitches 
or quantities. As with segmental sounds, presumably many more 
pitch differentiations can be perceived when contiguous (Fletcher 
states that there may be up to 2000 perceptible gradations from 
highest to lowest audible pitch 106 ) than when noncontiguous. 107 Even 
when a subject attempts to retain pitch and quantity at certain levels 
he is unable to do so; variation occurs, especially in quantity. 108 
Presumably much of this variation is below the threshold of percep- 
tion. 

The phonemicist makes relative acoustic judgments as regards 
contiguous or noncontiguous sounds in a phrase. The ear of a native 
speaker of a tone language registers this type of acoustic data (tones 
are “ high ” or “ low ” relative to each other, rather than to an 

104 Jespersen, Articulations, 3. 

105 Harris, Review of “ Foundations of Language,” by L. H. Gray, Lang., 
16 (1940), 229. 

106 Fletcher, Speech and Hearing, 162. 

107 The violinist, for example, in tuning his instrument immediately knows 
whether he is correct if the piano is struck, but he might be in error if he tried 
to tune merely by auditory memory of the note. 

108 Cf. Shohara. 
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absolute pitch ; changing the key, i.e. all the pitches, does not change 
tone; a single, level, isolated tone is not subject to classification — it 
must be put next to others to see if it is relatively low or high, and 
so on) , 109 Similarly, the musician can sing “ do, mi, sol, do ” in any 
key or octave, but retains the same intervals; speech differs primarily 
in that linguistic pitch intervals are not strictly fixed, but vary freely 
within certain limits or under certain fixed conditions, and the whole 
key is readily changed between (or even in the middle of) phrases. 
Some African tone languages transfer their somewhat flexible tone 
intervals to musical instruments of a few fixed notes (pipes, drums, 
and the like), for signalling . 110 It isn’t too clear why persons who 
readily “ carry a tune ” of a dozen or more relative pitches still find 
it so difficult to learn a tone language of three or four. 

The instrumental acoustic technic, the fourth descriptive method, 
at present seems to make one of its greatest contributions in analyzing 
prosody. Its accuracy goes infinitely beyond that of the auditory 
approach. Interesting studies can thus be made of intonation and 
prosodic factors which the auditory articulation technic cannot touch 
(e.g. of certain emotional expressive states ). 111 Right here lies its 
greatest danger; whatever is more refined than the ear can record 
becomes unsuitable for a phonetic classification. 

Apart from this phonetic difficulty, linguistic studies suffer by 
overdifferentiation of tone in tone languages, where tonemic studies 
(i.e. phonemic statements of tone) are the only ones pertinent to lin- 
guistic structure. This may be the case even when phonographs and 
similar devices are used to supplement auditory acoustic analysis. 
Doke, for example, used a dictaphone for “ careful analysis of sound 
and tone, when reproduced repeatedly.” 112 Result: nine registers, 


109 For brief suggestions for control of the variable factor in analysis of 
prosodic systems see Pike, Phonemic Work Sheet. 

110 See Von Hornbostel, “African Negro Music,” Africa, i (1928), 30-62. 
For linguistic tone affected by use of words in singing see Herzog, “ Speech Melody 
and Primitive Music,” Musical Quarterly, 20 (1934), 452-66. 

111 Expression of fear, anger, love, hate, and other emotions is, apart from 
a few interjections, of a prosodic type superimposed on speech. As such it is not 
readily susceptible to auditory articulation analysis. For an excellent instrumental 
study see Fairbanks and Pronovost, “ An Experimental Study of the Pitch Char- 
acteristics of the Voice during the Expression of the Emotion^,” Speech Mono- 
graphs, 6 (1939), 87-104. 

112 Doke, 8. 
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which were nonphonemic or so unimportant that he was able to give 
permission of Westermann and Ward 113 to reduce the number greatly 
for a rough translation into their system of tone marking. Had each 
of these registers been phonemic, this reduction would be comparable 
to eliminating [p], [t],and [k] from English for a like orthographical 
reason. 

Beach, working with similar means, says, “ As far as height of 
tone is concerned, most writers on intonation have contented them- 
selves with three degrees, which they call high, low, mid. Such sim- 
plicity is probably both adequate and desirable in any tonetic system 
consisting of only a few tones. I shall find it sufficient in my descrip- 
tion of the inherent tone of strong roots in Hottentot. But it is alto- 
gether inadequate if one wishes to show the exact intonation of a 
sizable piece of connected speech.” 114 

At the same time that such methods add too many tones, they 
tend to fix utterances in such a way that the normal free variation of 
the individual is lost . 115 This hinders phonemic analysis, since two 
levels of pitch which are everywhere freely interchangeable are to be 
equated as phonemically identical. When one repeatedly plays back 
a recording he hears none of this variation which would normally 
be present. 

A similar danger exists when the acoustician studies, in segmental 
phonemes, differences that are below the perceptual threshold. 
Twaddell criticizes Bloomfield’s phonemic discussion by saying: 
“ That we do not find any such constant characteristic factor is of 
course a commonplace of experimental phonetics.” 116 This very com- 
monplace becomes dangerous when it unwarrantably implies or states 
that no two sounds are alike. For the ear they must be alike if the 
ear itself is physiologically incapable of telling them apart. 

The instrumental acoustic technic can be of great value in finding 
perceptual thresholds of the ear in response to differing sound waves. 
This should aid in establishing a basis for the limitation of refinement 

113 Westermann and Ward, 202 n. 

114 Beach, 126. 

115 Beach (150-77) uses four recordings of the same material for analysis, 
with three different speakers, one of whom gives the story twice. This is decidedly 
helpful but fails to avoid the basic dangers. 

118 Twaddell, On Defining the Phoneme ( Language Monographs, Vol. 
16), 25 - 
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of phonetic units (see p. 45) or alphabets; it is useless, for example, 
for an observer to use an alphabet more detailed than sounds he can 
hear, since there will be nonsystematic and nonsignificant variation 
in choice of symbols. Interesting studies can be made of acoustic 
phenomena which occur below the perceptual level, and even sub- 
jective phenomena (things “ heard ” which are not in the sound waves) 
can be added to objective data (e.g. a subjective intensity vibrato can 
be added to an objective frequency vibrato 117 ). 

Presumably the acoustician will ultimately have his own basic 
segmental units of sound. Actually, he seems to start with an articu- 
latory unit. 118 He lacks also a significant classification for the study 
of linguistic structure and any system of phonetic transcription. 
Speaking of present limitations of the instrumental approach, Bloom- 
field says, “ In fact, the laboratory phonetician usually knows, from 
other sources, the phonemic character of the speech-sounds he is study- 
ing ; he usually formulates his problems not in purely acoustic terms, 
but rather in terms which he has borrowed from practical pho- 
netics.” 119 

Not all types of possible vocal sounds occur in speech. The 
acoustician is obviously limited to material which he finds in speech 
or interjections of various kinds; if he wishes to postulate sounds 
with which he has no contact, in order to contrast total possibilities 
with actual sounds of speech, he must postulate and produce them 
by articulatory analogies. Furthermore, if such limitations in the 
total of speech sounds prove to be due to articulatory factors, as 
seems likely (e.g. belches are not woven into normal speech presum- 
ably because of the articulatory difficulty of getting air with sufficient 
ease and speed to fix them into a stream of speech), then he could 
never discover this fact by a study of sound waves of speech. 

117 See Kock, “ Certain Subjective Phenomena Accompanying a Frequency 
Vibrato,” Journ. Acoust. Soc. Am., 8 (1936), 23-25. See also Snow, “ Change of 
Pitch with Loudness at Low Frequencies,” Journ. Acoust. Soc. Am., 8 (1936), 
14-19. Snow found that with certain low sounds, the louder they were made, the 
lower the pitch seemed to the observers (sometimes as much as fifty per cent 
lower), even though the pitch actually remained stationary. Observers had 
general, but not perfect, agreement. 

118 See de Groot, “ Instrumental Phonetics. Its Value for Linguists,” K. 
Akademie van Wetenschappen, Afdeelingen Letterkunde, M ededeelingen, 65, A. 
2 (1928), 48. 

119 Bloomfield, 85. 
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No classification of sound waves at present remotely approaches 
articulatory technic in pedagogical aids for the teaching of sounds. 

Phonograph records can substitute for the instructor in phonetics 
in his capacity as a model , 120 where the imitation-label technic over- 
laps the articulatory one. Perhaps photographs of waves from a 
cathode ray tube may sometime serve a similar purpose as a test of 
the success of imitation. 

In this chapter we have seen that phonetic systems have not been 
based upon the total number of sounds which are known to occur in 
speech. Unless consonant sounds are made with air coming out from 
the lungs and vowels are voiced, sounds fail to enter the main stream 
of phonetic inquiry and delineation, but are shunted into an obscure 
backwash of classification. For the investigation of these marginal 
types the writer has chosen to use an auditory analysis with descrip- 
tion in terms of articulatory movements supplemented by a few 
acoustic criteria. 

Auditory analysis is essential to phonetic study since the ear can 
register all those features of sound waves, and only those features, 
which are above the threshold of audibility and therefore available to 
any speech community, whereas analysis by instruments must always 
be checked against auditory reaction because it has no criterion apart 
from judgments of the ear to indicate what movements or features 
of sound waves are below the threshold of perception. Description 
based on movements of the vocal apparatus, even though supple- 
mented by acoustic terms, is more convenient than description ren- 
dered entirely by means of auditory acoustic judgments, since the lat- 
ter lacks sufficient points of reference which can be defined without the 
necessity of establishing them in relation to standards that can be 
duplicated only by imitation. In the following chapter further sound 
types will be discussed, ones which find even less place in phonetic 
literature than do the marginal sounds mentioned thus far. 

120 See Jones, Outline, §§ 47, 48. 



CHAPTER II 


NONSPEECH SOUNDS 

N ONSPEECH sounds 1 as produced by the human vocal appara- 
tus 2 are mentioned in books on phonetics in various connec- 
tions. In none have they been given treatment to indicate their 
importance for and bearing upon phonetic theory of speech or the clas- 
sification of vocal sounds as a whole. This can hardly cause surprise 
since they are one step further removed from the speech norms that 
have been established (pp. 5-12) than are the marginal sounds, which 
have also been neglected. Their chief usage is for illustration of 
various types of sounds. 

When an author discusses a sound type for which he finds no illus- 
tration in the phonemic norms of language, or which is used only in 
the phonemic norms of languages far removed from those with which 
his readers are likely to be acquainted, he may illustrate it with a 
nonspeech sound. Types requiring an inverse air stream (suction 
sounds), whether sounds inbreathed to the lungs or inverse to the 
mouth, may be illustrated by sounds of pain, 3 drinking, 4 pleasure, 6 

1 The term is subject to flexible usage in this paper, without strict delinea- 
tion. It may include sounds which do not occur in languages known to a par- 
ticular author, sounds produced in cultural contexts other than speech (even 
if similar sounds occur in language) , or even sounds thought to occur in speech 
only as rare variants of phonemes. 

2 No animal calls or the like are considered. Negus ( The Mechanism of the 
Larynx) has interesting descriptions of the sound-producing mechanisms of a 
great many animals; see especially his Appendix No. 2. Neither are any possible 
parallels of human to animal sounds considered. The naive thesis of Luthy (in 
his book The Human Speech Sounds, 1) will hardly find acceptance; five English 
vowels (which are also “ common to the voices of all mankind — from Hottentot 
to German ”) are said to be the fundamental sounds of the universe (and from 
these he develops his theory of consonants) as “ proved ” by the fact that they 
are identical with the vowels of “ [meaaou],” “as uttered by the cat” in this 
“ natural order.” 

3 See Catford, “ On the Classification of Stop Consonants,” Le Maitre 
Phonetique, 3d Series, 65 (1939), 5 ; Noel-Armfield, General Phonetics 4 , 120; 
Passy, The Sounds of the French Language, 86; Sweet, A Primer of Pho- 
netics 3 , 43. 

4 See Sweet, 43. 

5 See Noel-Armfield, 120; Passy, 86. 

32 
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kissing , 6 surprised commiseration , 7 impatience , 8 or by a signal to urge 
horses , 9 and the like. 

Types of movements may likewise be demonstrated even if un- 
known in speech as such; lip trills for guiding horses are an example . 10 

Nonspeech sounds are sometimes employed as analogies in describ- 
ing well-known speech sounds. A cough is frequently used to illus- 
trate a glottal stop; 11 the sound made by blowing out a candle, to 
illustrate some type of bilabial voiceless spirant; 12 a puff of breath, 
for aspiration . 13 A rarer expedient is a sound of surprise to demon- 
strate increasing force of breath pulse . 14 

A few nonspeech sounds are used as approaches to exercise for 
the attainment of some phonetic skill; this type may be seen in the 
practice of yawning to gain control of the velum . 15 

Occasionally abnormal conditions of the speech mechanism are 
mentioned and illustrated by resultant sounds — bronchial rattling, 
and snorting or sniffling , 16 for example. 

Apart from exemplifying speech sounds by showing the manner 
of their production, nonspeech sounds may be used to clarify the 
application of a phonetic alphabet. Bell, to demonstrate his “ visible 
speech,” listed and wrote with his symbols many nonspeech sounds, 
many more than can be found elsewhere. From his long list 17 one 
may mention the “ interjectional or inarticulate utterances ” of sigh- 
ing, panting, fluttering, shuddering, sobbing; the sneer, yawn, gasp, 
hiccough, pang, moan; the murmur of ridicule, vexation, disgust; and 
so on. 

6 See Passy, 87; Sweet, 43. 

7 See Bloomfield, Language, 94. 

8 See Passy, 87. 

8 See Bloomfield, 94; Noel-Armfield, 12 1; Passy, Sounds, 87. 

10 See Noel-Armfield, 8. 

11 See Forchhammer, How to Learn Danish 4 , 1; Jones, An Outline of Eng- 
lish Phonetics 4 , 534; Ripman, Elements of Phonetics, 6; Sweet, 12; Vietor, 
German Pronunciation, 39. 

12 See Jones, § 683; Kenyon, American Pronunciation 6 , 126; Noel-Armfield, 
86; Westermann and Ward, Practical Phonetics for Students of African Lan- 
guages, 18. 

13 See Bloomfield, 82; Jones, §497; Jones and Woo, A Cantonese Phonetic 
Reader, xi; Kenyon, 32; Noel-Armfield, 39. 

14 See Sweet, 48. 

15 See Westermann and Ward, 16. 

16 See Noel-Armfield, 3. 

17 Bell, Visible Speech (Inaugural Ed.), 30. 
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Two writers suggest several nonspeech sounds which their students 
are to use for practicing analysis of sounds and their transcription. 
Noel-Armfield’s “ peculiar noises ” 18 and Krapp’s “ sounds not used 
in articulate speech ” 19 are quite similar to each other and to part of 
Bell’s list. 

Krapp 20 and Noel-Armfield 21 employ a few nonspeech sounds 
to show a contrast with speech sounds. Sapir 22 has a discussion of 
the psychological and functional difference between the sound of 
candle blowing and the speech unit [hw] . Bloomfield mentions the 
difference in the interpretation of the sound of glottal stop by speak- 
ers of a language in which it is a phoneme and by foreign listeners who 
are trained not to respond to a “ catch in the throat.” 23 

Passy has a rare type of entry for a nonspeech sound when he 
includes whistles among “ accessory ” sounds because they are “ some- 
times used to convey meaning.” 24 Other vocal expressive devices 
he ignores. 

Many writers , 25 especially those describing the phonetics of single 
languages, make no use of nonspeech sounds. 

Klinghardt 26 emphasizes the importance of nonspeech sounds for 
phonetic theory. Unfortunately, he devotes attention to but a few 
such sounds (though he does discuss some nonspeech movements 
which illustrate the mechanism of certain sounds), and they do not 
lead him to depart in any marked degree from the customary type of 
classification. 

Phoneticians, failing to recognize the broadness of the field which 
is legitimately and profitably theirs, usually start with the assumption 

18 Noel-Armfield, 180 (kiss, sniff, snort, hiccough) . 

19 Krapp, The Pronunciation of Standard English in America, 143 (sigh, 
cough, cluck, click, sniff, ‘ hmph,’ ‘ huh,’ ‘ eh,’ ‘ hm,’ the sound of calling a cat, 
and the sound for starting horses). 

20 Krapp, 2. 

21 Noel-Armfield, 3. 

22 Sapir, “ Sound Patterns in Language,” Lang., 1 (1925), 37-51. 

23 Bloomfield, 82. 

24 Passy, 88. 

25 Gairdner, The Phonetics of Arabic; Karlgren, A Mandarin Phonetic 
Reader ( Archives d’Etudes Orientates, Vol. 13) ; Nicholson, A Practical Introduc- 
tion to French Phonetics; Stirling, The Pronunciation of Spanish; Lowie, “ Hidatsa 
Texts . . . ; with Grammatical Notes and Phonograph Transcriptions by Zellig 
Harris and C. F. Voegelin,” Prehistory Research Series, I, No. 6 (1939), 183-84. 

26 Klinghardt, Artikulations- und Horiibungen. 
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that it does not include sounds apart from speech. The first sentence 
of Sweet’s book states, “ Phonetics is the science of speech-sounds.” 27 
Jones’s Outline of English Phonetics begins: “ The Nature of Speech. 
Spoken language consists of . . . .” 28 Passy says of speech, “ The 
study of these sounds constitutes a science which we call phonetics. ” 29 
Noel-Armfield appears to take a broader view when he remarks that 
phonetics “ may be described as the science which deals with the 
gymnastics of the organs of speech, both in theory and in practice.” 30 
Practically, however, he abandons this position immediately, in so far 
as his own investigation is concerned, when he rules out of considera- 
tion various items which “ cannot be classed as speech-sounds.” 31 

Perhaps the reason for this assumption can be discovered in Sapir’s 
discussion, which shows that sounds of phonetic systems function 
differently from similar sounds apart from such systems. 32 In regard 
to candle blowing as compared with [hw] he notes that the one has a 
separate function in the business of blowing, while the other is good 
only as a symbol in speech, with no meaning otherwise; the one can 
be varied to [ J] or [x], and so on (but may never be voiced), or 
have great variation in intensity, while the other has variants to 
voicing, [w], and is not so strongly articulated; the one has no asso- 
ciations and is not in a system, while the other is associated with other 
sounds in words and is one of a limited series of other sounds from 
which it essentially stands aloof. 

This describes the speech sound according to its function; the 
phonetician should be able to describe a sound regardless of its func- 
tion (and hence when it is not in speech) ; otherwise his phonemic 
description of one language essentially influences that of another, or 
sounds of phonemic systems are described in part by functional fea- 
tures which are essentially variable (as parts of differing systems), 
rather than by a more stable articulatory phonetic procedure. 

The decision that a sound is to be included in general phonetic 
charts or classifications often rests upon the fact that it occurs fre- 
quently as a phonemic norm in speech, or as a variant of a norm ; if it 

27 Sweet, 1. 

28 Jones, 1. 

29 Passy, Sounds, 5. 

30 Noel-Armfield, 1. 

31 Noel-Armfield, 6. 

32 Sapir, op. cit. in Lang., 1 (1925), 37-51. 
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fails to find such usage, a sound tends to be omitted. The classifica- 
tion of laterals illustrates this. Never have I found a chart which 
included bilabial lateral sonorants as one of the basic types (although 
[f] is sometimes principally a lateral, actually occurring in speech 33 ). 
Jones’s handling of varieties of English [s] 34 demonstrates how non- 
phonemic variants are given a minor place, even when such sounds 
are known to appear. Because nonphonemic variants and nonspeech 
sounds are thus ignored or slighted, supposedly pure general phonetic 
classifications are actually cut down and tailored to fit an abstraction 
of phonemic patterns. 

Unless the phonetician can predict that certain types will never 
occur in speech, he cannot afford to overlook them, lest his classifica- 
tion be incomplete and inaccurate; but he cannot make such a predic- 
tion until he has studied them — and perhaps not even then. No one 
seems to have attempted to work out a system of nonspeech sounds 
as distinct from speech types with this prediction in mind. 

Perhaps such a study would show that certain types of nonspeech 
sounds are potential speech sounds, as evidenced by the fact that 
scattered members of related types have been found in language, 
while others are presumably ultimate nonspeech sounds, since no 
approximation of their general mechanism has ever come to light 
either as phonemic norm or variant. 

Instead of making a study to determine the relationship, investi- 
gators appear to have postulated a sharp division between speech 
and nonspeech sounds, on insufficient evidence. Noel-Armfield men- 
tions the difference, with a criterion which cannot be consistently 
applied. Speech sounds for him are “ voluntary,” and intentional. 
One of his illustrations of involuntary sound is the snore . 35 Yet what 
if the phonetician, or a boy in fun, voluntarily and intentionally pro- 
duces a snore or some other nonspeech type such as a belch, or cough, 
or sigh — does that make it a speech sound when it does not occur 
in speech? Krapp says that breath expelled forcibly through the 
nose is “ of course not an articulate speech sound. Articulate speech 

33 See Jones, § 693. 

34 Jones, § 709 and n. 

35 Noel-Armfield, 6. Noel-Armfield’s other illustrations (5), taken from 
abnormal physical conditions (a cold producing bronchial rattle, sniffing, snort- 
ing), merely obscure the issue, which is concerned with sounds produced by 
persons in normal health. 
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sounds are only those sounds which are articulated, or joined to other 
sounds in formation of sound groups or words .” 36 Yet sounds very 
similar to this, voiceless nasal spirants, occur as conditioned variants 
of [h] before nasals in Totonaco, Mazateco, Tlapaneco , 37 and many 
other languages. 

If no criterion can ultimately be found of the difference between 
types of speech sounds and nonspeech sounds, then a phonetic classi- 
fication must include them all. Otherwise, today’s nonspeech sounds 
are potential speech sounds, and may tomorrow be found in some 
language to upset the incomplete classification. Sounds which are 
nonspeech from the point of view of one language may be phonemic 
in another (or even in the same one, as Sapir’s illustration shows), 
and demand articulatory description. 

Perhaps every phonetician has had an experience similar to mine, 
of postulating certain sounds, thinking they would never occur in 
speech, only to discover that they do so. I have found, for example, 
occasional bilateral bilabial sonorants as nonphonemic variants of the 
vowel [u] in Mixteco . 38 Such a discovery is even more surprising 
when the sound proves to be the phonemic norm. 

In the earlier days of phonetic theory there was no opportunity to 
separate phonetics strictly from phonemics, simply because the latter 
was not understood. However, in spite of the development of pho- 
nemic study, which should presumably have caused the reexamination 
of phonetic data, classifications remain essentially unchanged. 
Neither does there seem to be any discussion except Bell’s where 
the author attempts to fit all nonspeech sounds known to him into 
his phonetic system. Those nonspeech sounds used for demonstrating 
speech sounds serve as pedagogical aids in the same way as do steam 
cylinders , 39 musical instruments , 40 the drawing of a cork from a 
bottle , 41 the use of a blowpipe , 42 and so on. Their purpose is illus- 
trative, not analytical; the system remains the same as if they had 
been left unmentioned. 

36 Krapp, 2. 

37 Indian languages of Mexico. 

38 An Indian language of Mexico. 

39 See Catford, op. cit. in Le Maitre Phonetique, 3d Series, 63 (1939), 4. 

40 See D. C. Miller, The Science oj Musical Sounds, 23; Passy, Sounds, 2-3. 

41 See Miller, 22. 

43 See Bell, 62. 
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Even Bell’s usage of nonspeech sounds was due more to a desire 
to show the universal application of his alphabet to speech than to a 
conviction that nonspeech sounds were basic to an understanding of 
phonetics. Although Bell’s work is now over seventy years old, his 
list represents a more complete analysis than any other I have seen. 
Unfortunately, though he was reaching after the understanding of 
many types of sounds, his failure to grasp numerous basic features, 
such as the mechanism of clicks, glottalized sounds (ejectives), and 
others, together with the fact that his list is very incomplete and 
utterly without organization, makes the work of little value for new 
study. 

Despite its being true that some sounds probably never have 
occurred nor will occur as phonemic norms, the phonetician inter- 
ested in speech should not ignore them. They may provide the con- 
trasting element which aids the appreciation of the highly selective 
nature of the majority of speech types. 

Many phonetic factors (compare “ Strictural Function,” pp. 56- 
65) have passed without sufficient comment simply because incom- 
plete data have hindered phoneticians from discovering the bias which 
those very data have engendered. 

The discovery of even a few basically new mechanisms could 
produce thousands of shades of sound when they had been expanded 
in the light of present phonetic theory; additional thousands can be 
made by recombinations of elements already well known. The num- 
ber of possible nonspeech sounds, as far as significantly new types are 
concerned, however, is not so limitless as might at first appear, 43 and 
any statement of the impracticability of studying them based upon 
such a premise proves to be invalid. 

Makers of phonetic alphabets, apart from Bell, have chosen to 
limit their field to sound types that occur in speech. The Interna- 
tional Phonetic Association has included only speech sounds in its 
alphabet. 44 Even Bloch and Trager, though they make certain 
marked advances in classification of sounds as a basis for their sym- 
bols (especially in regard to nonpulmonic mechanisms, following Cat- 
ford) have seen fit to do likewise. Evidently, to them, nonspeech 

43 Cf. Krapp, 2. 

44 The Principles of the International Phonetic Association, Supplement to 
Le Maitre Phonetique (1912) ; with chart revised to 1932. 
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sounds are not included in “ phonetic material,” since, even though 
they say that their system “ is believed to be all-inclusive ” 45 and “ is 
intended to provide the widest possible range of devices for the record- 
ing of phonetic material,” 46 no provision seems to be made for many 
sounds well known to them, some of which actually occur in types 
of communication. No devices are proposed which would, for ex- 
ample, give the full phonetic representation of the speech of a high 
school boy who whistles to get the attention of a friend, and replies 
with a “ Bronx cheer ” (the type which uses oral pressure and lip 
vibration) when the wrong person answers. Nor is there a basic sign 
to modify their symbols so they could record belched sounds (or the 
speech of a laryngectomized patient speaking with an air stream initi- 
ating in the esophagus rather than the lungs). When such sounds 
occur in connected speech they sometimes carry important social 
connotations , 47 and need symbols if complete representation is to be 
given. 

As a background for this study I searched for types of nonspeech 
sounds in the phonetics books (whose material I have reviewed above) 
and other sources. In books on voice I found no discussion of non- 
speech sounds, although Bender and Kleinfeld had in their exercises 
a number of nonspeech items . 48 Their analysis of sounds, however, 
cannot be trusted (witness their description of English [k], [g], and 
[n] as “palatal clicks” 49 ), and their exercises are often propor- 
tionately unworkable. 

A few helpful analyses of sounds resulting from some physical 
condition (hiccough, sneeze, cough, and the like) were found in medi- 
cal literature . 50 The most important of such material was a group of 
articles on esophageal speech , 51 which might be considered a border 

45 Bloch and Trager, Tables for a System of Phonetic Description (Prelimi- 
nary Ed.), 2. 

46 Bloch and Trager, 3. 

47 Says the host: “ We (yawn) have had a lovely (yawn) evening.” 
“(Burp),” replies the guest, “where’s (hie) my hat?” 

48 Bender and Kleinfeld, Speech Correction Manual, Containing 317 Prac- 
tical Drills for Speech and Voice Improvement. 

49 Bender and Kleinfeld, 205. 

50 Cunningham, Text-book of Anatomy 17 ; Gray, Anatomy of the Human 
Body 23 ; Howell, A Textbook of Physiology 14 ; Kimber, Gray, and Stackpole, 
Textbook of Anatomy and Physiology 10 ; Negus; Stevens and Davis, Hearing; 
Travis, Speech Pathology. 

51 See pp. 24-25, n. 95. 
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line between speech and nonspeech types. Sounds which the normal, 
well person is physiologically incapable of producing I have not in- 
cluded in the classification in this text, nor have I investigated many 
sounds produced from abnormal conditions. 

Other sounds were collected from scattered sources. Bell’s list 
had most of those resulting from mental states . 53 A group of phonetic 
students were invited to make contributions for analysis. The sounds 
of a deaf-mute added an item or two. A young baby’s babbling failed 
to offer more than one or two hints. A few types were obtained by 
trying to parallel by the voice sound-producing processes occurring 
in nature. A much more important source was a collection of my own 
sounds, recalled from earlier days, when they had been made in the 
carefree fashion mentioned by Sapir . 63 

The most important source of all was the deliberate effort to pro- 
duce sounds postulated by analogy with known speech sounds, start- 
ing both from speech sounds and from clues given by nonspeech 
sounds; especial attention was given to speech sounds other than 
those produced with an egressive air stream from the lungs. One type 
which I am still trying to obtain, for example, is an implosive stop 
(suction created by lowering the larynx) voiced with ingressive rather 
than egressive air from the lungs. 

The analysis of these sounds was by auditory articulation technic 
almost exclusively, although I took a few kymographic tracings which 
substantiated certain of my findings, and investigated some pharyn- 
geal sounds by means of a dentist’s mirror. On points where the 
auditory technic would not serve, I consulted various works on instru- 
mental phonetics , 54 though these rarely concerned themselves with 
nonspeech sounds. 

The instrumentalist should be able to profit by a study of a more 
complete phonetic classification than is at present available, since 
many items must creep into a gross acoustic record which are ignored 
by the phonetician, who can easily cut them from his record and 

52 Laughter, sighing, and the like. For fear, anger, and other emotions see 
p. 28, n. in. 

53 Sapir, Language, 46 n. “ When we shout or grunt or otherwise allow our 
voices to take care of themselves as we are likely to do when alone in the country 
on a fine spring day, we are no longer fixing vocal adjustments by voluntary 
control. Under these circumstances we are almost certain to hit oft speech sounds 
that we could never learn to control in actual speech.” 

64 For books and articles see Bibliography. 



NONSPEECH SOUNDS 41 

hearing — or who, in fact, may hear them only with considerable 
difficulty and training — but which the acoustician would find harder 
to ignore since there is within the mechanical record no perceptual 
aid to label certain factors as “ unimportant.” 

The value to the practical phonetician of the study of nonspeech 
sounds, apart from its specific contribution to the investigation of 
the demarcation of speech and nonspeech types, is similar to that of 
the study of marginal sounds: a broader classification is provided for 
pedagogy, and for the elimination of phonemic influence in phonetics, 
while a surer foundation is provided for researches into the differing 
functions of various parts of the mechanism, for the postulation of 
nonphonemic segmentation of continuums, and so on. 



CHAPTER III 


UNITS OF SOUND 

S PEECH, as phoneticians well agree, consists of continuous streams 
of sound within breath groups; 1 neither sounds nor words are 
separated consistently from one another by pauses, but have to be 
abstracted from the continuum. Phonemicists concur in the belief 
that some unit of speech, the phoneme, can be discovered as the basic 
constituent of a linguistic system. Their agreement does not extend 
to the exact definition of such a unit, 2 but that is not pertinent to 
this study; no new definition is attempted. 

A different problem is that which is here answered in the affirma- 
tive: Is there a significant halfway point between the continuum 
and the phoneme? Is there a real, nonfictitious segment of sound 
which is not a phonemic one? 

Many writers actually work with a phonetic unit under the name 
“ speech sound ” or “ phone.” I have found no attempt to show the 
difference between a nonspeech sound and a speech sound as the start- 
ing point for such postulations of speech sound. This has already been 
discussed in Chapter II (pp. 35-38) and needs no further mention 
here. 

Delineation of speech sounds has centered about two factors. The 
first of these represents an attempt to differentiate speech sounds 
from phonemes. 3 

Jones uses the term “ speech sound ” to mean one particular vari- 
ant 4 of a phoneme that has “ definite organic formation and definite 

1 See Bloomfield, Language, 76 ; Br 0 ndal, “ Sound and Phoneme,” Proceed- 
ings oj the Second International Congress of Phonetic Sciences (1936), 43 ; Jones, 
An Outline of English Phonetics *, §§41, 1002-1006; Kenyon, American Pro- 
nunciation, 31-32 ; Noel-Armfield, General Phonetics 4 , 57-58; Passy, The Sounds 
of the French Language 3 , 23-28; Sweet, A Primer of Phonetics 3 , 45. 

2 For a grouping of many definitions see Twaddell, On Defining the Phoneme 
( Language Monographs, Vol. 16). Twaddell himself (33) considers the phonemic 
unit to be fictitious. 

3 Jones, § 189. 4 Jones, §§ 193-99. 
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acoustic quality which is incapable of variation.” 5 Thus the initial 
sounds of ‘ keep,’ ‘ cool,’ ‘ call,’ are “ three distinct sounds.” 6 

Noel-Armfield at first glance appears to make a similar statement: 
“ A speech-sound is an individual variety, whereas a phoneme is a 
family of individual varieties, which, as it were, cluster around a com- 
mon centre, and is the name given to a number of sounds which are, 
from a scientific point of view, varieties of the same speech-sound, 
though these sounds are so alike acoustically that when in combina- 
tion with other sounds the normal ear fails to detect any difference.” 7 
He illustrates by pointing out the difference between the initial sounds 
of 1 kid ’ and ‘ cad,’ and so on. But what is the basic unit of which 
he makes the speech sound an individual variety? He does not say, 
unless by “ scientific point of view ” he means a phonemic one, in 
which case “ speech-sound ” for him is identical with “ phoneme ” and 
varieties of each are to be equated. 

The succeeding statement, “ Different varieties of the same speech- 
sound may belong to one phoneme in one language, and in another to 
two phonemes,” would substantiate such an interpretation, as would 
a statement elsewhere that the term “ glide ” may be applied, not 
only to a “ necessary intermediate sound between two regular speech- 
sounds, but also to any sound leading up to or following any speech- 
sound represented by the ordinary spelling [sic ] .” 8 

Kenyon uses “ speech sound ” as equivalent to “ phoneme it 
is significant for identifying or distinguishing meanings; 0 it may be 
applied to a group of sounds varying because of phonetic surround- 
ings; 10 the sound [h], even though occurring in a dialect, is not a 
speech sound if not distinctive . 11 

Sounds for Kenyon are neither phonemes nor speech sounds if 
they are not distinctive. This applies to the aspiration of stops in 
English 12 (but not to languages in which aspirated and unaspirated 
stops are phonemically different 13 ) and to various types of nonsignifi- 
cant transitions 14 (but not to phonemic glides 15 ); a speech sound 

5 Jones, § 193. 11 Kenyon, 141. 

6 Jones, § 193. 12 Kenyon, 32, 50, 139. 

7 Noel-Armfield, 30. 13 Kenyon, 32-33. 

8 Noel-Armfield, 61. 14 Kenyon, 32. 

9 Kenyon, 32. 15 Cf. Kenyon, S9> 152-61, 233. 

10 Kenyon, 33. 
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may be identical with a nonspeech sound which occurs in a different 
position in the same language 10 or in different languages . 17 

Kenyon also mentions the possibility of a different definition of 
“ speech sound ” from that which he regularly employs, in the sense 
of one definite position or movement of the speech organs. “ In this 
sense, the [t] sounds in till, still, and outdo are different speech 
sounds.” 13 This represents the start of an attempt to establish a 
unit between the continuum and the phoneme, but Kenyon does not 
develop it further. 

A second attempt at the delineation of speech sounds (in addition 
to contrasting them with phonemes) consists in the effort to cut sounds 
from their continuums, to mark their borders, to isolate them from 
other sounds. This endeavor has brought into play more phonemic 
assumptions than the first one, and at present there seems to be 
available no consistent or useful manner of cutting the continuum 
apart from a linguistic (phonemic) division. 

The definition of “ speech sound ” to mean variants of phonemes 
in particular contexts gives no aid whatsoever to this project, since 
the borders of such units are still phonemic, not phonetic, ones: the 
points at which the continuum is cut under such a procedure are those 
determined by the linguistic system, not by any innate phonetic 
features of the continuum. 

Any phonetic (rather than phonemic) system of dividing a con- 
tinuum into unit components must be just as applicable to a series 
of nonsense syllables as to some particular language. Any semantic, 
phonemic, linguistically distinctive criteria applied immediately make 
the unit invalid for analysis of phonetic structure per se. 

Beach 19 states the problem precisely, then immediately slips into 
the use of phonemic criteria for an invalid solution: “ The primary 
unit of phonetics is the speech-sound or phone . . . the difficulty [of 
describing that unit] lies in prescribing exactly what amounts of the 
chain of speech shall constitute single phones.” “ No definition has 
yet been devised which could be equally well applied to all languages. 
In English it is convenient to think of [nd] in endeavor as two phones 

16 Kenyon, 33. 

17 Kenyon, 84-83. 

18 Kenyon, 33. Compare the discussion above (pp. 42-43) of Jones’s defini- 
tion of speech sound. 

19 Beach, The Phonetics of the Hottentot Language, 29-30. 
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[n] plus [d] ; in Zulu and Xhosa it is convenient to consider [nd] 
in indoda as a single phone.” One readily sees that the criterion of 
“ convenience ” here is purely the phonemic status of the sounds con- 
cerned. 

Some investigators state or imply that no consistent unit of sound 
can be found, other than the phoneme (and perhaps the syllable), 
which divides a continuum. Rather than that, the possible divisions 
are said to be infinite, with no logical stopping place between the 
phoneme and the continuum. Although Stetson asserts that a con- 
tinuum can be divided phonetically into syllables by articulatory (and 
resultant acoustic) phenomena, he denies that a continuum can be 
divided into sounds - — they “ have no independent existence in 
speech.” 20 Brpndal states, “ A sound can be viewed as built up of 
any desired number of successive parts, whereas a phoneme is in- 
divisible from the standpoint of a given language.” 21 Bloomfield 
says that a speech utterance “ can be viewed as consisting of any 
desired number of successive parts.” 22 Other phoneticians maintain 
that a glide is composed of “ infinitesimal intermediate positions,” 23 
even though these may be inaudible. 

Some of these statements must be qualified in view of the neces- 
sity of recognition of perceptual units. If one kept cutting a con- 
tinuum, one would run to the absurdity of trying to analyze one half 
of a sound wave. Only the instrumentalist can cut things so fine. 
The person dealing with vocal sounds as they register on the ear must 
limit himself to sounds within the threshold of perception of normal 
individuals. Anything too fast or consisting of too minute a change 
to be perceived cannot, by its very nature, be a phonetic unit from 
an auditory or speech standpoint . 24 This limits the number of units 
in any given sequence a very great deal. Only instrumental re- 
searches can decide exactly how much. 

Even the best-trained phonetician, with the widest phonemic back- 
ground, can hear but a limited number of sounds in a sequence. 

20 Stetson, Motor Phonetics (Arch. Neer. Phon. Exper., Vol. 3), 33. See 
below, n. 63. 

21 Brpndal, 43. 

22 Bloomfield, 76. 

23 Sweet, 52. See also Passy, 88. 

24 Voiceless stops do not enter this category since their central phenomena 
are slow enough to register as lack of sound vibrations. 
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If different phoneticians, who have had different phonemic back- 
grounds, persist in hearing almost the identical number of “ sounds,” 
might it not be indicative that they are approaching the perceptual 
limit for rapid speech? If they are approaching such a limit, and 
find near agreement, may they not be approaching the recording of 
phonetic segments which are ultimately basic to phonemic units and 
which limit the possibilities of those units? 

Several items contribute to the conclusion that there must ulti- 
mately be some such phonetic segmentation behind speech: The 
“ very accurate ” 25 transcriptions of phonetic experts differ but 
little from each other, or even from those of persons with slight 
phonetic training, when it comes to the total number of segments 
which they record for a given sequence. The differences occur pri- 
marily at what might be called fluctuant segments (or fluctuant s) in 
phonemic-phonetic unit division: the fricative of homorganic affri- 
cates, “ nasalized ” stops, aspiration of various kinds, the glottal stop, 
and glides from one vocalic position to another. These fluctuants 
are difficult to hear in proportion to their grouping into single pho- 
nemes, or their lack of phonemicity, in the language of the observer. 
By far the largest number of disagreements of hearing and recording 
which the phoneticians would have among themselves lies in the realm 
of minute articulatory differences — points of articulation, degrees 
of articulation, and the like. These have practically no bearing on 
determining the number of segments which compose a sequence, but 
only upon the identification of one segment with another which is not 
contiguous to it, or its minute articulatory classification. 

Alphabets reflect the same situation. They are designed to record 
approximately equal numbers of segments in a sequence, but add 
different types of diacritics for minutiae of an articulatory order. 
Bloomfield, commenting on the development of phonetic alphabets, 
remarks that their inventors meant their alphabets to be able to record 
every acoustic variety of sound in language, an aim which only a 
mechanical record could fulfill, and adds, “ in practice, the phonemic 
principle somehow slipped in: usually one wrote a symbol for each 
phoneme, but these symbols were highly differentiated and cluttered 
up with diacritical marks, for the purpose of indicating ‘ exact ’ 
acoustic values.” 26 It appears extremely significant that a sharp 

25 Cf. Jones, § 568. 26 Bloomfield, 87. 
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difference existed between the number of symbols in the sequence 
(even if by way of near-phonemic writing), and their complicated 
modification. Again, this points toward a basic and real division of 
any continuum into a specific number of segments; the description 
of these segments is what receives sharply varied representation. 

One more factor contributes to the conviction that there must 
exist a phonetic segmentation more universal and fixed than the 
phonemic one: If a continuum represents a mere flow of sound with 
no points at which segmentation naturally occurs (according to 
BrpndaPs statement [see p. 45] that it may be viewed as built up 
of any desired number of parts), how could it possibly have happened 
by chance that phonemic systems the world over cut such a continuum 
in almost identical places, except where the fluctuants are concerned? 
Even the very points where languages in their phonemic systems differ 
in establishing their phonemic borders, and the narrow range of sound 
types within which these differences occur, emphasize the fact that 
a universal segmentation lies behind vocal expression, and that in a 
certain few places this natural segmentation is not sufficiently sharp- 
cut to force all phonemic systems to make quite the same choice at 
those spots. 

It follows, also, that a phonetic segmentation can be carried on 
with utility, if some consistent method can be found for classifying 
the fluctuants. It is precisely at these points that phoneticians allow 
phonemic influence to prevent such a classification, as will be seen 
in a moment. 

Instrumentalists can in general cut sounds out of sequences, but 
Haden says they have difficulty in doing so unless there is a regular 
alternation of voiced and voiceless sounds. 27 De Groot remarks, 
“ There can be little doubt that the current textbooks of phonetics 
scarcely trouble about giving a definition or a characterization of a 
speech-sound that might offer a starting point for experimental re- 
searches. In fact, I did not find an adequate definition in any one 
of them.” 28 Since, as Bloomfield states, laboratory phoneticians 

27 Haden, The Physiology of French Consonant Changes (Supplement to 
Language, Dissertation 26) , 16. See, however, Parmenter and Trevino, “ The 
Length of the Sounds of a Middle Westerner,” Am. Speech, 10 (1935), 128. 

28 De Groot, “ Instrumental Phonetics. Its Value for Linguists,” K. 
Akademie van Wetenschappen, Afdeelingen Letterkunde, Mededeelingen, 65, A. 
2 (1928), 48. 
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usually formulate their problems in terms “ borrowed from practical 
phonetics,” 20 any clarification of the fluctuants should be of consider- 
able aid. To be of most value to the instrumentalist such a definition 
must distinguish very carefully between the data of the sound waves 
and the perceptual elements registered by the ear. 

Numerous investigators agree that speech sounds are made with 
the speech organs in positions which are fixed or practically fixed. 
Jones speaks of a “ speech-sound proper ” 30 as one of definite organic 
formation; he contrasts vowels with diphthongs, the former being 
produced with “ approximately stationary ” organs of speech, the 
latter with a “ clearly perceptible movement.” 31 Westermann and 
Ward state that a phone results from “ one position of the organs 
of speech.” 32 Sweet holds that “ Analysis regards each sound as a 
fixed stationary point . . . analysis concerns itself only with the 
middle of the fully developed sound,” 33 while flap consonants “ are 
pure glides, organically as well as acoustically, there being absolutely 
no fixed point in their formation.” 34 Kenyon says that in the phrase, 
‘ the most of the time,’ there are thirteen speech sounds and for each 
“ the speech organs are momentarily in a definite position ” ; 35 since 
[u] implies fixed position of lips and tongue “ for the duration of the 
vowel,” 36 [w] cannot be [u] plus a vowel. 

Perhaps fluid positions rather than fixed positions should be taken 
as the basis for a satisfactory theory of phonetic segments, since a 
fixed position would appear to be a departure from reality. Never- 
theless, there may actually prove to be a sharp basic perceptual dif- 
ference between sounds usually classified as fixed and certain ones 
(e.g. vowel glides) of the fluctuant type. Such a division may ulti- 
mately explain why fluctuant types are combined differently from the 
“ fixed ” types in various phonemic systems. 

Jones 37 and Kenyon 38 both mention two kinds of sounds in 
speech: speech sounds (fixed, the phonemes) and glides (moving, the 
transitions). Both authors list as exceptions to their groupings the 

29 Bloomfield, 85. 30 Jones, § 190. 31 Jones, § 240. 

32 Westermann and Ward, Practical Phonetics for Students of African Lan- 
guages, 28. 

33 Sweet, 44. 36 Kenyon, 152. 

34 Sweet, 64. 37 Jones, §§2-6, 102, 219. 

35 Kenyon, 32. 38 Kenyon, 31-32, 33, 152-61. 
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vowel glides which make the “ consonants ” [w] and [j], and some 
of the diphthongs. These exceptions correspond directly to their 
phonemic interpretations of English, and nothing they advance coun- 
terbalances the strong implication that phonemics, not phonetic 
structure, has caused this change of classification. Though Jones 
speaks of most glides as “ inevitable ” and as being formed along 
“ the most direct route ” between speech sounds , 39 and contrasts 
them with “ independent ” vowel glides which are “ expressly made ” 
(rather than being the inevitable medium between other sounds), 
such a division is not convincing in view of the fact that the vowel 
glides “ start in the position of one vowel and move in the direction 
of another 40 the audibility of the glide will not explain its inde- 
pendence, since, though “ most glides are inaudible,” at least some 
are “ always clearly audible.” 41 

If one cannot have consistent criteria for determining whether or 
not a glide is a separate sound, or part of a preceding or a following 
one, there remains no chance for consistent segmentation. If pho- 
nemic criteria are used, the division is valid only for phonemic pur- 
poses. 

Passy says that “ Glides are accessory sounds produced involun- 
tarily in the articulation of given speech sounds.” 42 “ Involuntarily ” 
with him obviously does not mean “ inevitably,” since he discusses 
errors of certain foreigners who fail to pronounce some of the glides 
properly . 43 The implication, then, is simply that by glides he means 
nondistinctive parts of some phonemic unit. Noel-Armfield conveys 
a similar implication when he says that, strictly, the term should “ be 
used to express nothing more than a necessary intermediate sound 
between two regular speech-sounds [i.e. phonemes].” 44 

Sweet says that synthesis looks at a sound “ as a momentary point 
in a stream of incessant change,” 45 but admits his inability to mark 
off the segments in this stream by admitting that for certain types “ it 
is often difficult to know whether to write the glide or not.” 46 Fur- 
thermore, when he starts dealing with sounds in speech he classifies 
practically everything as a glide: “ Most consonants, as compared 
with vowels, have more or less the character of glides 47 flap con- 

39 Jones, §§3, 5. 42 Passy, 89. 45 Sweet, 44. 

40 Jones, § 219. 43 Ibid. 46 Sweet, 53. 

41 Jones, § 6 and n. 44 Noel-Armfield, 61. 47 Sweet, 64. 
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sonants are “ pure glides ” 48 as we have said ; voiceless stops are, 
“ acoustically speaking, pure glide-sounds ” ; 49 the aspiration of stops 
is a glide . 50 

If this aspiration glide is strong .enough, it is said to become an 
“ independent element.” 51 Passy 52 likewise considers a reinforced 
glide to be an independent sound, and Jones speaks of such a sound 
as “ practically a full independent [h] ” 53 (this implies that when 
Jones previously stated of less strong aspiration that it was inde- 
pendent , 54 he meant something less than a separate segment). At 
this point phoneticians face a difficult problem if they wish to define 
a stop (or plosive consonant; Jones uses “stop” only for the 
closure 55 ) as having to have three parts to be “ complete.” 56 When 
one of the factors is lacking, or one element is “ independent,” con- 
siderable adjustment has to be made in the segmentation. 

Twaddell has a statement about a “ phonetic fraction ” which 
relates to the segments: “ Corresponding to phonologically different 
forms are significantly different phonetic events. Each of these pho- 
netic events is composed of phonetic fractions. These fractions are 
usually reckoned in terms of intervals during which the articulatory 
organs are substantially at rest, or are leaving such a position ; inter- 
vals during which some articulatory organs are making a unidirec- 
tional movement; or intervals during which some articulatory organs 
are moving toward a position, maintain it substantially, and leave it. 
These fractions are, in short, what we call ‘ speech-sounds,’ i.e. time- 
intervals of the actual utterance-events.” 57 The principal difficulty 
with this as a working method for cutting out phonetic fractions is 
found in the overlapping criteria. How is one to tell whether a phe- 
nomenon wherein the organs move toward a position, maintain it, 
and leave it, composes one segment functioning as a whole; or one 
segment functioning as a fixed position; or three segments, with 
approaching, fixed, and releasing positions, and so on? 

48 Ibid. 51 Sweet, 58-59. 54 Jones, § 563. 

49 Sweet, 56. 52 Passy, 92. 55 Jones, § 562. 

50 Sweet, 56-57. 53 Jones, § 568. 

56 Jones, §561; Noel-Armfield, 64; Passy, 71-72 (cf. 74, n. 1: “But of 
course for a complete [m] the mouth must be closed at the beginning and open 
at the end ”) ; Ripman, Elements of Phonetics, 85 (in which the “ momentary 
pause” can be left “ out of account”!). 

57 Twaddell, 41-42. 



UNITS OF SOUND 


5i 


If sounds with either moving or fixed positions can both be seg- 
ments, some method must be devised to tell which criterion should be 
applied in particular situations. In lieu of that, phonemic structure 
rather than phonetic nature is certain to swing the balance one way 
or the other, as it did in the cases reviewed above. 

Phonetic terminology and classification charts have been influenced 
by phonemics in regard to certain fluctuant segments, especially in 
the naming of certain groups as if they were single phonetic segments. 
Groups such as [tj] and [ts] have been called affricates; [nd], 
nasalized stops; [kw], labiovelars; [ph], aspirates; [ai], diphthongs. 
Why are there no unit names for groups such as [af], [fl], [rt], 
[to], and the like? Phonetically there is no explanation for this dif- 
ference, since [gu] entails less change of articulation than [gw], [ba], 
the same as [ph], and so on. The explanation is perforce phonemic: 
in certain sound groups (at the fluctuant points) two segments fre- 
quently combine into single phonemes; as such they have acquired 
names. There is no objection to continuing such convenient terms 
or adding more, but they should be redefined strictly from a phonetic 
point of view. Bloch and Trager 58 are among the few writers who 
make this essential distinction by eliminating such groups from their 
phonetic classifications, but who provide at the same time convenient 
symbols for the use of the phonemicist. 

Many short descriptions, and even some longer ones, of the pho- 
netics of languages lack any attempt to define a speech sound . 59 In 
general, these are the same accounts which contain no discussion of 
nonspeech or marginal sounds. 

As regards a continuum, the study in hand has two primary 
analytical aims: The first of these is to provide a means for cutting 
a continuum at any point, regardless of its relationship to other sounds 
or its natural segmentation, and to provide a terminology which will 
allow for the articulatory description of that sound in regard to cer- 
tain specific major features of its production. This description makes 
no pretense at completeness, often ignoring finesse in fixing point of 
articulation and the like; the choice of factors for description is based 

58 Bloch and Trager, Tables for a System of Phonetic Description (Pre- 
liminary Ed.), 3. 

59 See Gairdner, The Phonetics of Arabic; Jones and Woo, A Cantonese 
Phonetic Reader; Karlgren, A Mandarin Phonetic Reader ( Archives d’ Etudes 
Orientates, Vol. 13) ; Stirling, The Pronunciation of Spanish. 
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primarily on strictural function (see pp. 56-65, 129-36, and 151-56). 
Such a piece of the continuum must include enough of the whole to 
form a perceptible impression on the ear. This is our phonetic frac- 
tion (see p. 1 16). 

The second basic purpose with regard to the continuum is to pro- 
vide a workable method for the delineation of natural phonetic seg- 
mentation. This has nothing to do with the articulatory description 
of the production of any particular sound except in so far as it be- 
comes essential for the determining of the center or borders of a 
specific segmental unit. This segmental unit is to be determined 
entirely apart from phonemic function; if the influence of the latter 
creeps in unawares, as it almost certainly will at times because of 
phonemic prejudice from one’s native language, to that extent the 
effort has failed. A corollary of this aim states that such a segmenta- 
tion procedure is equally applicable to any and all languages, or to 
any stream of nonsense syllables. In so far as it may prove upon 
further testing to have come short of this goal, it must be revised. 

The basic difficulty involved in segmentation (a difficulty due to 
individual variation in perceptual ability because of physiological 
limitations, phonemic background, or phonetic training) is taken care 
of in the procedure. The strategic fluctuants reach segmentation by 
the very same procedure as do all other sounds ; the same criteria, for 
example, apply to [w], [p], [s], and [u] . One basic limitation re- 
mains to the segmentation technic here set forth : although it identifies 
the number of segments in a sequence (subject to the perceptual factor 
already mentioned), and points out the center of such segments, the 
borders cannot be strictly delineated. One cannot state the exact 
point where the division comes between segments, even though the 
centers are identifiable. 

Once segmentation is complete, then segments of perceptually 
similar or identical sounds are grouped into phonetic units, which 
correspond directly with what in general usage is known as the 
“ phone ” except that phonetic units are more rigorously controlled 
as to size. The features selected for description can be consistently 
chosen on the basis of strictural function (rather than by phonemic 
assumptions or inconsistent factors), with the reservation that many 
smaller details — the exact point or degree of articulation, for ex- 
ample — which may be phonemically distinctive when the segment 
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appears in speech, will inevitably be omitted from the phonetic de- 
scription thus given. When the phonemicist points out to the phoneti- 
cian certain other features in any phonetic unit which he wishes 
described, these can be added at will, within the limits of the articu- 
latory procedure. 

Since the number of segments in a particular continuum ap- 
proaches the number of phonemes in that continuum, and the differ- 
ences occur largely at well-known fluctuant points, an impression- 
istic phonetic record of a new language proves theoretically legitimate 
as well as practically valuable as a tool for the phonemicist in his 
analysis of the phonemic structure of that language. At the same 
time that it makes available a less prejudiced classification for the 
description of phonemes by an articulatory technic, it lays a better 
foundation for a study of the exact kinds of segments which can 
possibly be found to combine into single phonemes; Trubetzkoy pre- 
sents material related to this possibility . 60 A segmental analysis 
should also give further validity to a nonphonemic study of speech 
such as the dialectologist produces . 61 

In this work a sound abstracted from speech, of indefinite extent 
but composed of no less than one segment and of no more segments 
than are known to be joined into single phonemes in some language, 
is a speech sound. The term is in contrast to related terms which 
have already been introduced (“ phonetic fraction,” “ segment,” 
“ phonetic unit ”). 

By using the same criteria that were developed for the definition 
of the phonetic segment but applying them in a somewhat different 
manner, conditioned by the differing function of the moving parts of 
the vocal mechanism, we can arrive at an articulatory delineation of 
the syllable. This is quite similar to the description of the syllable 
given by Stetson in terms of chest pulses , 62 with three principal excep- 
tions: (r) Stetson failed to allow for a perceptual factor and was 

60 Trubetzkoy, Anleitung zu phonologischen Beschreibungen, 13, (Edition 
du Cercle linguistique de Prague, Brno, 193s). 

61 For the necessity and validity of the nonphonemic approach in this con- 
nection see Bloch, Review of “ Phonetic Transcriptions from ‘ American Speech ’ ” 
(edited by J. D. Zimmerman), Lang., 16 (1940), 174-75; Kenyon, 36-37 (for 
both dialect geography and historical phonetic change) . 

62 Stetson, 36: “The syllable is one in the sense that it consists essentially 
of a single chest pulse, usually made audible by a vowel, which may be started or 
stopped by a chest movement.” 
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therefore forced to describe some groups of three sounds 63 (two non- 
syllabic consonants and a vowel) as containing two syllables, when 
the ear does not hear them so; his recordings of the chest pulses 
included some that were too weak to register audibly. He says, for 
example, of pfeil, and like words, that the second consonant some- 
times becomes a “ preliminary silent syllable followed by a voiced 
syllable.” 64 (2) He was confused by the spelling when he tried to 
describe consonant clusters. He says of certain of them, “ languages 
differ in their handling of many of these groups. The German spells 
a group ‘ tsch,’ which the French spells ‘ tch,’ and the English 1 ch.’ ” 65 
(3) He shows that another criterion than that of chest pulses may 
have led him to his conclusion when he grants the possibility of sound 
groups such as [aia] or [ala] being made with single chest pulses but 
constituting two syllables; 66 in this case the oral movement would 
divide the syllables. He has no experimental evidence of the existence 
of this kind of syllable. Even if series of sounds of this type can be 
made with single chest pulses (and probably they can be, on the per- 
ceptual level at least), it appears preferable not to introduce a new 
criterion but to use one only — one based uniformly on strictural 
function (see below, p. 116). Introduction of a second type of articu- 
latory criterion would probably undermine the consistency with which 
the first could be applied. 

Other criteria which have been advanced for the syllable, such as 
relative loudness of the phonemes, 07 sonority, 03 prominence (made 
up of inherent sonority, length, stress, special intonation, or a com- 
bination of some of these), 09 and change in stress or pitch, 70 are not 
incompatible with an articulatory definition; these acoustic phe- 
nomena are the results of some articulatory movement, and might 
frequently be more convenient to use for description, if the two 
factors can be equated. 

Some type of natural segmentation of sound sequences must be 
discoverable when even those who deny its existence use symbols 

63 Stetson (33) actually worked with sounds or segments of sound, and 
used symbols to represent different segments, even though he denied emphatically 
that they exist. 

04 Stetson, 124-25. 

65 Stetson, 121. 68 See Kenyon, 68-69. 

66 Stetson, 58-59. 69 See Jones, §§209, 211, 215, 218. 

67 See Bloomfield, 125. 70 See Noel-Armfield, 51-52. 
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which assume this division. The smallest segmentation is into sounds. 
A larger segmentation which is related to it, but which functions dif- 
ferently in the continuum, is that of the syllable. The next chapter 
will attempt to show that within single sounds the productive move- 
ments do not all have the same function. 
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STRICTURAL FUNCTION 

I N THE production of sounds not all parts of the vocal mechanism 
act alike: neither all complete closures nor all partial ones have 
the same place in sound production, nor do the oral, nasal, and pharyn- 
geal cavities. It is this difference in the action of various parts of the 
mechanism within the production of single sounds which is strictural 
junction. 

This type of internal function of parts of sound mechanisms con- 
trasts sharply with an external type, contextual junction, where sounds 
as complete units are related to one another in continuums as mem- 
bers of syllables or parts of a phonemic system. 

Nowhere have I found any systematic discussion of such differ- 
ences of internal function. In many respects the differences are ig- 
nored, and strictures serving dissimilar functions in a sound are treated 
on a level. Jespersen’s analphabetic system, 1 for example, seems to 
give them all equal prominence without sufficient attempt at strictural 
differentiation. 

Everywhere, however, functional differences are assumed. They 
enter into the warp and woof of every phonetic classification. In fact, 
no very significant articulatory phonetic (not phonemic) classifica- 
tion could possibly be made without some of these assumptions. In 
the next few pages I shall attempt to prove that such factors are 
present in sound production ; to show that phonetic literature assumes 
or hints at many of them; and to demonstrate the need for a systematic 
presentation of such elements. Later a classification will be at- 
tempted (pp. 129-36), and a convenient descriptive order for these 
functional ranks postulated (pp. 151-56). It is this descriptive order 
which can give us a rational choice of basic items in describing a 
phonetic unit apart from phonemics. 

1 Jespersen, The Articulations of Speech Sounds Represented by Means of 
Analphabetic Symbols. 

56 
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In sounds in which two mechanisms are operating at the same 
time (a combination of productive types; for details see pp. 94-103). 
a single stricture often has two different but simultaneous functions. 
The velar closure of a nasalized click offers an example. For the 
click part it is moving backward, drawing air into the mouth ; of itself 
it produces no sound — the release of a closure farther forward does 
that (e.g. the release of the lips for a kiss). For the voiced nasaliza- 
tion it serves rather as a closed valve which prevents air escaping 
from the mouth and shunts the air out the open nasal passage, pre- 
cisely as it does for an ordinary velar nasal which has no superim- 
posed click. If, on the other hand, the nasal passage were closed, 
but the vocal cords were vibrating (for a “ voiced click ”), the velar 
closure would be functioning precisely as for a voiced velar stop [g] , 
while at the same time in its other function it would be moving to 
empower the oral click. 

A contrast of two functional types is mentioned by Westermann 
and Ward: the glottal closure is said to be secondary to a primary 
oral closure in ejectives. 2 Catford remarks that glottalic pressure 
stops (which are the same as Westermann and Ward’s “ ejectives ” 
and the “ glottalized stops ” of this study) involve inevitably two 
releases, of which the outer is primary and the inner secondary ; with 
velaric pressure or suction types the velaric release is secondary to 
primary outer release. 3 

The type thus labeled “ secondary ” has sometimes been called 
“ inner closure.” 4 This term does not clarify its functional peculiar- 
ity. On the one hand, it is not more inward than many other closures 
which do not enter the group (velar closure for [k] is not an “ inner 
closure,” nor is glottal closure for [?] ) ; on the other hand, when two 
closures are present, the innermost is not always an “ inner closure ” 
(as in double stops of [pk] type 5 6 ), and one can actually have a func- 
tional “ inner closure ” which is outward in the mouth (see pp. 
101-2). Function, not absolute or relative position, distinguishes 

2 Westermann and Ward, Practical Phonetics for Students of African Lan- 
guages, 101. 

3 Catford, “ On the Classification of Stop Consonants,” Le Maitre Phone- 
tique, 3d Series, 65 (1939)1 5 - 

4 Bell, Visible Speech (Inaugural Ed.), 62; Catford, op. cit. in Le Maitre 

Phonetique, 3d Series, 65 (1939), 2. 

6 See Noel-Armfield, General Phonetics i , 119; Westermann and Ward, 107. 
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the type — the function of initiating the air stream (not of being a 
“ suction stopper ” 6 ). 

One of the most frequent and apparent differences of function is 
evidenced in the treatment of the opening leading to the nasal cavity. 
For convenience in this discussion the upper part of the soft palate 
facing the nasopharynx is the velic (so called in this discussion to 
distinguish it from the velum, which represents the lower side toward 
the mouth) ; the closure of the nasal passage is therefore a velic closure 
(in contrast to velar closure, when the tongue touches the soft palate) . 

Velic closure receives much less attention than closure of the lips 
or closure by the tongue, and the like. No phonetic treatment which 
I have seen includes in the definition of its technical term for the 
middle sound in [apa] any specific mention of velic closure; all of 
them take specific note of bilabial nature (i.e. “ bilabial voiceless 
stop,” never “ velic bilabial voiceless stop ”). In detailed description 
of stops a few books 7 meticulously include the velic closure ; even 
these do not consistently mention the velic closure when a brief 
reference is made to particular stops after the full description has 
been given. More often the velic closure is assumed or receives a 
brief covering statement . 8 9 This difference in treatment between 
closures at velic and closures at oral points of articulation reflects 
a difference in function. 

The oral closure, not the velic closure, is made to identify a par- 
ticular stop. This applies regardless of whether or not the stop has 
oral approach or release, or both (as in [apa]), or whether it has 
velic approach or release, or both (as in [mbm] ) . If the velic closure 
were on a par with the oral closure, performing the same function, then 
the stops would have to be classified by either approach or release, 
since phonetically there would be no other way of determining whether 
the sound is a velic or an oral stop. Actually, phonemic usage of 
the sounds probably has a very great determining effect on the inter- 
pretation of the stop of [mbm] as a bilabial , 0 rather than a velic. 
This cannot be the entire explanation. An innate functional differ- 

6 Bell, 56. 

7 Jones, An Outline of English Phonetics 4 , §§ 496, 511, 532 ; Kenyon, Ameri- 
can Pronunciation °, 40-44 ; Stirling, The Pronunciation of Spanish, 1 1 . 

8 See, for example, Ripman, Elements of Phonetics, 1. 

9 For nasal and lateral release see Jones, §§499, 505, 313, 523, 535, 546, 
586-90; Kenyon, 50. 
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ence must be reflected; otherwise why is not a velic stop a phoneme 
in any language? Why is it not given a place in phonetic alphabets 
apart from nasal release of an oral sound? Why is it so readily passed 
off as a convenient assumption? Why is it so difficult to teach stu- 
dents to remember the closure or to get a kinesthetic sense 10 of its 
presence? In fact, I have never seen a velic stop listed as a vocal 
sound separate from other stops except in Ripman, where it is called 
“ velar ” or “ faucal.” 11 

With continuants, the assumption of velic function (in contrast 
to explicit oral strictures) is more striking; the marked tendency is 
for authors to give no statement whatsoever of the velic position, 
simply assuming it to be closed, until suddenly in one section they 
note its lowering for nasalized sounds. Jones has three introductory 
chapters on vowels , 12 including a description of their cardinal values, 
in which no mention is made of velic position until, at the very end of 
the chapter, nasal vowels are discussed; 13 in later detailed descrip- 
tions it is mentioned for each vowel . 14 Passy leaves it entirely to as- 
sumption for both consonants and vowels , 15 except for nasals and 
nasalized types . 16 

The opening to the esophagus seems to be completely ignored. Yet 
in sounds with esophageal air (belches) it precisely parallels the glot- 
tis for pulmonic types. Its usual passive function is different from 
a lip closure or a velic closure when all three occur simultaneously in 
[p] , yet it is closer to the latter than to the former. 

There is general agreement among phoneticians 17 in treating cer- 

10 Differences in the sensitivity of the velic and tongue or lips can scarcely 
be the complete explanation, although they are undoubtedly influential in the 
determining of function, when the velic functions with extreme facility in the 
changing of oral to nasal sounds, and so on. Regardless of possible physiological 
explanation of function, the functional differences themselves are the important 
thing to note here. 

11 Ripman, 86. 

12 Jones, §§ 96-163. 

13 Jones, §§ 164-63. 

14 Jones, 63 n., §§ 25s, 268, 277, 283, 298, 306, 316, 324, 333, 344, 395, 409, 
422, 447, 436. 

15 Passy, The Sounds of the French Language 2 , 60-62, 69. 

10 Passy, 62-64, 69. 

17 See Bloch and Trager, Tables for a System of Phonetic Description (Pre- 
liminary Ed.), s; Gairdner, The Phonetics of Arabic, 20-21 ; Jones, § 333; Noel- 
Armfield, 96, 117-18; Sweet, A Primer of Phonetics 3 , 37; Westermann and 
Ward, 107. 
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tain types of strictures as modifications of others; the modifications 
are “ secondary ” (and the others “ primary,” or “ basic ” sounds). 
The list includes labialization, palatalization, velarization, pharyn- 
gealization, and, in occasional accounts, a few others. Bloomfield has 
pointed out that this is not a phonemic distinction, nor one necessarily 
seen reflected in the systems of languages. 18 There seems to be some 
basic division here much deeper than mere convenience, however. 
One could protest against saying “ alveolarly sibilantized labial ” for 
“labialized [s] or “ bilabially stopped palatal” for “palatalized 
[p].” The descriptive order appears to be chosen here on some real 
principle, not from fancy. Although this division is so widespread, 
and given representation in phonetic alphabets, 19 yet I have seen 
no concerted attempt to explain why certain strictures were considered 
basic and other strictures secondary, modifying them. The determin- 
ing factor cannot be the point of articulation, since the lips are primary 
in [p] but secondary in labialized [s] ; nor absolute amount of open- 
ing, since the same amount can be primary in [j] and secondary in 
palatalized [p] . There seems to be no effort to postulate all possible 
types of combination of two such secondary modifications for single 
sounds; Bloch and Trager mention one type, labiovelarization. 20 

Sounds in which two articulations are present, one regarded as 
primary (or basic) and the other secondary (or a modification), are 
seen even more significantly to be analyzed on the basis of some 
functional difference when they are contrasted with other types of 
sounds having two articulations which are not analyzed in this way. 
One of these is any oral sound with velic closure. That closure is 
not considered a “ modification ” (of, for example, the alveolar stric- 
ture in [s] ), nor “ basic ” (which is reserved for the alveolar stricture 
in forming the sibilant). Another instance of two articulations nei- 
ther of which comprises a modification is seen in double stops; Wes- 
termann and Ward point out this difference, in that neither of the 
closures is secondary or primary in respect to the other (in contrast 
to palatalized [p], where lips are primary). 21 In the various types 
of clicks and similar sounds the “ inner closure ” would enter neither 

18 Bloomfield, Language, 109. 

19 See Heepe, Lautzeichen und Hire Anwendung in verschiedenen Sprachge- 
bieten, and, more recently, Bloch and Trager. 

20 Bloch and Trager, 7. 

21 Westermann and Ward, 107. 
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the “ primary ” nor “ secondary ” groups ; compare a labialized click 
[t] , where the lips form a modification, the tongue tip a primary clos- 
ure, and the velar closure is different from either one. 

In spite of the fact that lip rounding of vowels is sometimes said 
to be secondary, 22 this cannot be equated with function of labialization 
of consonants. One can scarcely say that [u] is a “ labialized basic 
back vowel,” since [u] seems to have basic character of its own, fully 
as much as does an unrounded high back vowel; [u] does not sound 
like an unrounded back vowel with something extraneous added, but 
sounds, rather, like a new entity. A functional difference is involved 
in this distinction. 

Oral and nasal cavities are treated in entirely different ways; func- 
tionally the nasal cavity is treated as secondary to the oral one. This 
is not always admitted. Bloch and Trager state that the oral-nasal 
division is considered basic, 23 but later under resonance types, in- 
stead of paralleling oral with nasal, they parallel oral with nasaliza- 
tion; that is, a combination of oral plus nasal, in which the latter 
takes a secondary, modifying, place. 24 

Were oral and nasal cavities to be treated as functionally equal, 
“ oral vowels ” would parallel “ nasal vowels ” (that is, [m] , [n] , not 
nasalized vowels); [a] would parallel [q] (since for [a] the velic 
closes the back of the nasal cavity, whereas for [q] the tongue closes 
the back of the oral cavity) ; [m] would parallel nasalized [a] with 
velic open but nostrils pinched closed (since for [m] the front of the 
oral cavity is closed but the remainder is used for resonance, whereas 
for [a] the velic’s being open would allow the nasal cavity to be used 
for resonance while it was closed at the front end by the nostrils) ; 
[x] would parallel a velic fricative (since [x] has closure at the back 
of the nasal cavity but friction at the back of the oral one, while a 
velic fricative has back oral closure plus back nasal friction) . Further- 
more, if both cavities were wide open, a new type of sound would 
be produced, an “ oral-nasal,” in which neither cavity would pre- 
dominate and neither would be secondary; actually this type of sound 
is called a nasalized vowel — never an “ oralized nasal ” — showing 
that the nasal, cavity is considered secondary. The two parallel cav- 

22 See Gairdner, 34; Noel-Armfield, chart facing 180; cf. also Bloomfield, 105. 

23 Bloch and Trager, 2. 

24 Bloch and Trager, J. 
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ities could be diagrammed in a kind of F form, in which the lower 
cavity can be closed at any point by lips or tongue; but the upper 
can be closed only at the back by the velic (or at the front, by the 
fingers pinching the nostrils). 

To complicate the picture further, Bloch and Trager appear to call 
the nasals “ nasal stops.” 25 Since it is obvious that such sounds are 
not stops, but continuants, one must assume either that the termi- 
nology is chosen because of phonemic usage of the sounds as con- 
sonants, in a way similar to stops, or that the points of articulation 
of the oral closure in each case may be the same. Neither of these 
reasons would lead to such an association of stops and nasals if the 
oral and nasal division were treated as fundamental, and (therefore) 
if free nasal egress of air with oral closure were paralleled with free 
oral egress with nasal closure. 

One sees, then, that resonance of the nasal cavity has been treated 
as secondary to resonance of the oral cavity. Nevertheless, this type 
of modification cannot be equated with the type which is caused simply 
by adding another oral stricture, as Bloch and Trager very definitely 
show in their outline ; 26 that is, modification by nasalization functions 
differently from modification by strictures such as labialization. 

That articulatory movements and sounds (apart from voice) in 
the pharynx have been given little study, and have little place in 
phonemic systems, we have already mentioned . 27 It may be that 
phonemic systems use such items so infrequently because they are in 
some way subordinate as regards strictural function. One can give 
further evidence of the subordinate nature of all sounds below the 
oral cavity, as far as speech is concerned. In speaking, it appears 
that absolutely nothing a person can do, with the exception of cutting 
off sound by glottal or pharyngeal stops, changes the sequence of 
consonant and vowel phonemic types. Why does a vowel retain sim- 
ilar basic characteristics when whispered (some writers urge whisper- 
ing of sounds for analysis since their character is then more clearly 
recognized 28 ) as when voiceless or voiced(?) murmured(?) in fal- 
setto ( ? ) with pharyngeal or faucal constriction (? ) Presumably it 

25 The nasal stops (“ nasalized ” is also implied) mentioned in (2) are appar- 
ently to be identified with the nasals on chart 5 (see also Ripman, 13). 

26 Bloch and Trager, 7. 

27 See p. 8. 

28 See Gairdner, 32; Sweet, 23. 
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indicates a difference in function between oral strictures and those of 
the pharyngeal cavity. Even those sounds which are phonemic norms, 
produced in the pharynx, tend to be misunderstood (see Sweet and 
Passy for pharyngeal spirants labeled bronchial ones 29 ). Voicing is 
treated entirely differently from any other modification, with a func- 
tion distinct from that of labialization and the like. With but few 
exceptions, 30 voicing is classified as causing one of the basic consonant 
divisions, 31 but, in contrast, not as causing vowel divisions. 32 

Whereas contact, or even narrowing, at the center of an air pas- 
sage is always considered important (e.g. for making a fricative con- 
sonant closure, or producing contact for a lateral), the very same, or 
even far greater, contact at the side of the air passage (e.g. for [i] 
or [s] or rounded lips) goes unclassified or even unnoted except that 
such contact is used with palatograms 33 for determining by implica- 
tion the size and height of the opening, and so on; the contact itself 
is either ignored as being unimportant, or else it is tacitly assumed that 
such contact naturally follows if one describes a certain opening. 
Nevertheless, the contrast of side contact with central contact, and 
the contrast of contact which partly closes an opening with contact 
which completely closes an opening, represent marked functional dif- 
ferences which should be classified more expressly. If Kenyon uses 
contact at the center of the tongue (as distinct from a side opening) to 
prove [1] a consonant, 34 why has not the same argument been used 
with equal legitimacy to prove that the side contact on [i], which 
produces the narrowing he mentions, 35 makes it also a consonant (in- 
stead of an appeal being made to resonance criteria to show this) ? 
Only tacitly assumed functional differences in the types of contact 
can explain such a divergence of treatment. 

Should all assumptions of differences of strictural function be elim- 
inated from phonetic discussion, articulatory classification would be 

29 See p. 8, n. 41, for references. 

30 Sweet’s classification of voiced consonants is obscure because he relates 
them to vowels (31). 

31 See Jones, §173; Kenyon, 46; Noel-Armfield, 28; Passy, 68; Sweet, 
3i. 4i. 

32 Cf. p. 5, n. 1 ; also Chapter V, “ Classification Criteria,” p. 72. 

33 See Jones, §§ 79-81, figs. 37, 39, and so on; Kenyon, 60; Noel-Armfield, 
142-47. 

34 Kenyon, 58-59. 

85 Kenyon, 59. 
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vastly different and drastically hampered. Two of the criteria remain- 
ing would be the degree of closure and the number of strictures of each 
such degree. If, for example, two degrees of closure were used, com- 
plete closure and partial closure, a study by number of strictures only 
would fail to separate the following obviously different types: 

Nasals and [a] and glottal stop; all have one complete closure 
(velic, oral, glottal, respectively) ; 

Egressive-click fricatives and oral pulmonic fricatives; each has 
one complete closure (velar, velic) and one partial one; 

Pulmonic stops (except [ ? ]) and (oral) click stops; both have 
two closures (velic and oral, velar and front oral) ; in a similar way, 
modified pulmonic stops (modified by labialization, and so on) and 
modified click stops could not be separated (each has a single partial 
stricture added to its basic type) ; 

Glottalized stops and double pulmonic stops ; both have three full 
closures (glottal, velic, front oral; velic and two oral) ; 

Compressive and rarefaction types of stops of fricatives, in any 
one mechanism (since the number of strictures, degrees, and places 
of closure are all the same). 

Even in this list at least one functional assumption crept in, the 
ignoring of esophageal closure. Were this factor included, it would 
change the grouping and obscure relationships further by adding an- 
other closure to pulmonic and glottalized types, but not to glottal 
stop or to (oral) clicks. The samples were chosen also to prevent 
the entrance of the complicating factor of side contact in vowels and 
the like. Nonspeech sounds would show other parallels in addition 
to these, but the basis has not yet been laid for discussing them. 

An articulatory classification to be workable must postulate dif- 
ferences of function or rank in various types of closures, narrowings, 
and partial contacts, as well as functional differences between the 
cavities themselves. 

Once different ranks are classified, the description of any phonetic 
unit may contain items from each of certain ranks, if they cause 
changes of sound above the perceptual threshold for the particular 
observer. In this way, after the phonetician has isolated his own 
units without phonemic aid, he can also give a rough articulatory de- 
scription of them without particular phonemic admixture. The de- 
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tails that he cannot describe he must leave subsumed under these 
rough articulatory labels as a partial superimposing of imitation-label 
material. If he is capable of describing certain differences which do 
not occur within his chosen descriptive patterns, he must wait for the 
phonemicist (or some phonetic project such as a linguistic atlas) to 
call attention to the need for a discussion of them. This to some 
extent would relieve the phonetician of the onus of describing without 
plan anything and everything he can hear, by setting up a definite 
program for the description of certain basic phonomena of strictural 
function. 



CHAPTER V 


CLASSIFICATION CRITERIA 

T HE most basic , 1 characteristic, and universal division made in 
phonetic classification is that of consonant and vowel. Its de- 
lineation is one of the least satisfactory — a difficulty passed on to 
the instrumentalist . 2 Frequently for descriptions of single languages 
the division is assumed, with no attempt to define it . 3 The distinction 
is often presented as if it were clear-cut, with every sound belonging 
to one or the other of the groups. Jones, for example, says, “ Every 
speech-sound belongs to one or other of the two main classes known 
as Vowels and Consonants.” 4 Later, however, various sounds are 
mentioned by him which have to be discussed separately under dif- 
ferent rules, or with various kinds of reservation, because they do 
not neatly catalog themselves . 5 Occasionally, in contrast to this, a 
writer frankly admits that his definition either of vowels or of con- 
sonants is unsatisfactory . 6 

The most important reason for the difficulty at the border line 
between consonants and vowels is that many criteria, and criteria of 
conflicting status, are used. When these various criteria are strictly 
applied, each of them draws the boundary at a different place; the 
groups determined by the several criteria do not coincide. In this 
predicament it is left to the “ predominating ” feature or features to 
decide which criterion shall classify particular sounds . 7 Such an 

1 See Bloch and Trager, Tables for a System of Phonetic Description (Pre- 
liminary Ed.), 2. 

2 Cf. Fletcher, Speech and Hearing, 7. 

3 See, for example, Forchhammer, How to Learn Danish' 1 -, Ward, An 
Introduction to the Ibo Language. See also The Principles of the International 
Phonetic Association, Supplement to Le Maitre Phonetique. 

4 Jones, An Outline of English Phonetics 4 , § 96. 

5 See Jones, 46, n. 5, §§ 183, 227, 231, 793, 799. 

6 See Noel-Armfield, General Phonetics 4 , 9 n. ; Westermann and Ward, 
Practical Phonetics for Students of African Languages, 20 n. 

7 See Kenyon, American Pronunciation 6 , 57; Passy, The Sounds of the 
French Language 2 , 4-3. 


66 



CLASSIFICATION CRITERIA 67 

expedient disguises the real problem without solving it, since the de- 
cision as to which factor is predominating is arrived at by still other 
criteria, and, to make the matter worse, these often are not set forth 
clearly, if they are stated at all. Articulatory, acoustic, and con- 
textual criteria are three of the basic types used to determine whether 
sounds are consonants or vowels; they will be treated below in this 
order. 

Obstruction of the air stream at times constitutes one articulatory 
criterion for consonants. 8 This by itself is insufficient. With stops, 
the obstruction is complete, and no question arises about their con- 
sonantal nature, but continuants cause a problem, since some of them 
are classed as fricatives (e.g. [s] ), while others are regarded as vowels, 
even though both types have partial obstructions. The criterion which 
determines whether a partial obstruction constitutes a consonant- 
causing obstruction (rather than a vocalic lack of obstruction) is 
not wholly articulatory. 9 What does constitute obstruction of the 
kind to make consonants? If contact at the sides of the tongue, or 
small height of opening (so that a small hole is left for air to escape), 
is such, then [i] would be a consonant, since Kenyon notes that it may 
have more tongue contact than fl] , 10 There seems to be no articula- 
tory measuring rod for degree of constriction or obstruction which 
marks the consonant-vowel border. Even a velic closure in oral vow- 
els keeps the air stream from escaping by the nose, and so a pure 
obstruction criterion which ignored rank of stricture (see above, pp. 
61-62) would label oral vowels as consonants. 

Narrowing 11 as a criterion fares precisely as does contact; 12 no 
articulatory measure for degree of contact or narrowing is provided to 
show the point at which a vowel becomes a consonant. If one starts 

8 See Bloomfield, Language, 102; Jones, §§97, 99; Nicholson, A Practical 
Introduction to French Phonetics, 42; Noel-Armfield, 9; Ripman, Elements of 
Phonetics, 10, 12 ; Soames, Introduction to English, French and German Pho- 
netics 3 , 33; Stirling, The Pronunciation of Spanish, 6; Sweet, A Primer of 
Phonetics 3 , 22, 31 ; Vietor, German Pronunciation 5 , 36 ; Westermann and Ward, 
19-20. 

9 See acoustic types (pp. 70-72) for criterion of audible friction. 

10 Kenyon, 60. 

11 See Jones, §97; Kenyon, S7-S9; Nicholson, 42; Ripman, 12; Soames, 
33; Sweet, 31; Vietor, 36. See also Jespersen (The Articulations of Speech 
Sounds Represented by Means of Analphabetic Symbols ) for degrees of opening. 

12 See Bloomfield, 102 ; Kenyon, 37-39. 



68 


PHONETICS 


with a little narrowing, caused in turn by slight contact, and gradually 
increases contact to produce further narrowing, oral sounds will gen- 
erally have begun as vowels and ended as consonants, with no articu- 
latory marking 13 of the transition. 

One clear articulatory essential for a vowel is that at least part 
of the air must be coming out of the mouth, although part may also 
be escaping through the nose . 14 This eliminates stops and nasals from 
vocalic classification, but it does not separate oral consonants from 
oral vowels. 

Jones and Kenyon use movement as a criterion for certain con- 
sonants (glides, [j], [w]) in contrast to vowels, which are said to 
have fixed, or relatively fixed, position; 15 but Jones also uses move- 
ment as a criterion for certain types of vowels (diphthongs 16 ). The 
criterion is vitiated when he shows variants of both the glides and the 
diphthongs, written with the same letters (or the same letters plus 
a length sign), which are not quick glides but have at least part of 
the sound relatively fixed . 17 He states that the two are difficult to 
distinguish in certain combinations 18 and fails to set forth the method 
for determining which is which . 19 Further, for the less prominent 
part of a diphthong he uses the term “ consonantal vowel,” which 
he states is not quite accurate, without giving reasons , 20 though per- 
haps he objects to suggesting that a vowel has consonant character- 
istics. To say the least, these difficulties demonstrate that the line 
of demarcation between vowel and consonant is not too readily 
drawn , 21 and that Jones’s criteria are not too easily applied. One 
might reason that no positions are fixed, but all are fluid, and there- 
fore that no movement criterion is valid; the movement, however, 
might in certain instances be below the perceptual level or near it, and 


13 Cf. pp. 138-39. 

14 See Bloomfield, 102; Jones, §§97, 99; Passy, 56; Westermann and 
Ward, 19-20. 

15 Jones, §§ 102, 800, 813; Kenyon, 59, 152, 155, 233. 

16 Jones, § 219. Presumably Jones considers diphthongs vowels, if every 
sound is either consonant or vowel (§ 96). 

17 Jones, §§ 227, 799. 

18 Jones, 58, n. n, § 226. 

19 Presumably the primary criterion is contextual function, to be discussed 
below, pp. 73-77. 

20 Jones, § 231. 

21 As is well known. Cf. Soames, 33. 
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hence give rise to contrast between perceptual movement and percep- 
tual lack of movement. One therefore could make no objection on 
this particular basis to movement as a criterion. 

Certain other factors (speed, force, voicing) with articulatory re- 
lationships are more conveniently discussed under contextual criteria. 
These will be given attention below (see pp. 73-77). 

Two acoustic criteria of importance are used in separating vowels 
and consonants. The vowels are considered to be the sounds which 
are naturally more sonorous and resonant than the consonants. 22 At 
this point the sonority with which we are concerned is that which 
might be applied as a criterion to sounds in isolation. Sonority which 
is studied as relatively greater for one sound than for another in a 
particular context is a separate matter (see below, pp. 70-71). The 
sonority that classes a sound as a vowel or consonant regardless of its 
surroundings is an acoustic (not a contextual) criterion. 

Strangely enough, phoneticians seem usually to establish their 
consonant-vowel groupings by some such criterion in conjunction with 
contextual types, and then attempt to define or defend such divisions 
by mechanical, articulatory technics, ignoring the very factors which 
originally caused the postulation of the division. Jones (§97), for 
example, first defines a vowel as being “ a voiced sound in forming 
which the air issues in a continuous stream through the pharynx and 
mouth, there being no obstruction and no narrowing such as would 
cause friction.” Then later he elucidates his reason for choosing such 
a delineation: “It so happens that the sounds defined as vowels 
in § 97 are noticeably more sonorous than any other speech-sounds 
(when pronounced in a normal manner) ; and that is the reason why 
these sounds are considered to form one of the two fundamental 
classes.” 23 

This statement Jones modifies by an important footnote in which 
he admits that cardinal [i] is apparently less sonorous than some other 
speech sounds. Kenyon also shows that certain sonants (e.g. [ 1 ] 
sounds) are very resonant, almost enough so to come under definitions 
of vocalic sonority; 24 he is forced to use other criteria (tone quality 
and front contact) to prevent that classification. Passy likewise sees 

22 See Kenyon, 57 ; Westermann and Ward, 89, no. 

23 Jones, § 100. 

24 Kenyon, 60. 
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that strict application of such criteria would tend to the classification 
of sonants as vowels, but exclaims that linguistic instinct protests 
against the idea. 25 Such factors demonstrate conclusively that nat- 
ural sonority is not at all the complete reason for basic vowel-con- 
sonant classification; linguistic instinct is formed primarily by con- 
textual function. Nevertheless, natural sonority seems to have a very 
large influence on consonant-vowel delineation as one important fac- 
tor determining contextual function itself. Nonsyllabic vowels of 
[a] or [e] type and the like 26 (in diphthongs) give one of the strong- 
est evidences of natural sonority as a criterion for their vocalic classifi- 
cation, since at that point contextual prominence is not so marked. 

The second main acoustic criterion is friction, and is used to some 
extent by practically all writers on the subject. 27 Sounds wherein 
some obstruction causes audible friction are thereby considered to be 
consonants. Friction in this way proves to be a test of the degree of 
obstruction for vowel and consonant differentiation. Of itself, an 
articulatory technic cannot establish this boundary since, as we have 
already shown (p. 67), obstruction passes from vowel type to con- 
sonant type by imperceptible degrees, while differences of strictural 
function also affect the result. At this point auditory acoustic technic 
supplements the articulatory one by providing a helpful criterion. 

Actually, however, the test is not consistently applied. For sounds 
such as [f] and [s] there is no doubt; they are obviously consonants. 
But certain sounds classified as vowels (close types) may actually 
contain audible friction, 28 while, on the other hand, many sounds 
without audible friction (voiced sonants) 29 are nevertheless consid- 
ered to be consonants. Various writers mention this difficulty. 30 

Even with friction and sonority as criteria, the border line between 
consonants and vowels is uncertain, hazy, and wavering. Friction, 

25 Passy, Petite phonetique comparte 3 , 14. 

26 ’ See Jones, §§ 225, 423, 466. 

27 See Bloomfield, 102; Jones, 31, n. 5-6, §§97, 99, 811; Nicholson, 42; 
Noel-Armfield, 8, 9 ; Passy, Phonetique, 108 ; Sounds, 67, 86 (and noise, 68) ; 
Soames, 42 ; Sweet, 13, 22 ; Westermann and Ward, 19. 

28 See Jones and Woo, A Cantonese Phonetic Reader, xii; Karlgren, A Man- 
darin Phonetic Reader ( Archives d’Etudes Orientates, Vol. 13), 6-7; Noel- 
Armfield, 9n.; Passy, Sounds, 15 ; Sweet, 22. 

29 See Jones, 24 n., 183, 749, 796; Kenyon, 48. But see Noel-Armfield, 9 n., 
and Passy, Sounds, 70, 73, who hear slight friction in such sounds. 

30 See Noel-Armfield, 90.; Westermann and Ward, 20 n. 
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designed to measure the amount of obstruction, in turn needs some- 
thing to measure its degree of noise, since it shades from no per- 
ceptible noise to strong amounts, 31 from zero audible friction in voiced 
open vowels and sonants to slight friction in certain close vowels and 
sonants and to considerable amounts in fricatives. The measuring rod 
actually used generally proves to be contextual function. 

One of the marked weaknesses in current usage of friction as a 
criterion for consonant-vowel differentiation lies in the failure to 
distinguish satisfactorily between two types of friction which func- 
tion very differently and have different origins, even though the border 
lines are not sharp between them. Generally speaking, one type re- 
tains its audibility when voiced (e.g. for sibilants) ; the other is a 
weak fricative and audible only when sounds are voiceless 32 (e.g. 
most vowels, certain sonants). The first type results from stricture 
at a single local point; the second is due to cavity friction, that is, 
voiceless resonance of a chamber as a whole 33 caused by air going 
through it as through an open tube. Friction of either type may 
occur in cavities of different rank and function. To lump together 
as fricatives all sounds derived in these two ways, or to classify all 
of them immediately as consonants, because of friction, regardless of 
the type of friction or the rank of the cavity in which it occurs, in- 
vites difficulty and inconsistency in the separation of vowels and con- 
sonants. 

By working with these two degrees of friction and combining 
them at the same time with a classification of strictural function which 
subordinates pharyngeal and glottal activity to oral types (see pp. 
129-33), one can arrive at a significant statement concerning the 
phonetic patterning of [ h] , in relation to voiceless vowels, and whis- 
pers, and “voiced [h].” 34 Both voiceless and voiced vowels have 
cavity friction; as with other types of cavity friction, the first tend 
to be audible and the second inaudible. If to each type one adds local 

31 Cf. Bloomfield, 97. 

32 See Jones, § 820, for one illustration of local and cavity friction (voice- 
less fricative [j] and voiceless semivowel lj]), which are hard to tell apart; the 
high tongue position for the cavity almost localizes the friction, causing the 
difficulty. 

33 See Kenyon, 57. 

34 Cf. Jespersen, Lehrbuch der Phonetik 4 , 91-93; Jones, § 779; Kenyon, in 
Webster, § 44 (10) ; Noel-Armfield, 109-12 ; Sweet, 55 ; Westermann and Ward, 86. 
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glottal friction, the first gives whispered vowels 35 and the second 
vocalic timbres of “ voiced [h] creating the proportion, voiceless 
vowel (i.e. [h] ): whispered vowel :: voiced vowel voiced [h].” 
The letters [h] and [fi] would simply be convenient symbols repre- 
senting any vocalic mouth position with the requisite inner modifica- 
tions. That [h] may be used to function in the system of a particular 
language in a way similar, for example, to [p], [n], or [s] (rather 
than [a] or [u] ) is a factor of contextual function which should not 
affect the description of the sound in a classification built upon char- 
acteristics of the method of production. Contextual influence upon 
the description of [h] will be pointed out a bit later (pp. 76-77). 

Presumably the criterion of sonority is partly responsible for the 
entrance of voicing into vowel definitions. 36 Conversely, the criterion 
of friction has been partly responsible for the elimination of voiceless 
or whispered vowels 37 from those definitions (contextual function ap- 
pears to have been the cause of their retention as “ vowels ” of ab- 
normal type) . 

In languages the nasals seem to have no voiced type with local 
(velic) friction, but only a type with cavity friction. The same is 
true of the voiceless type ; hence the friction is very soft. 38 Laterals 
usually, but not always, have inaudible cavity friction when voiced, 
but upon becoming voiceless narrow the opening sufficiently to get 
local friction 39 (so producing the difference in strength of friction 
between voiceless [n] and [1] mentioned by Bloomfield 40 ). Voiced 
frictionless sonants may be said to be similar to vowels in resonance, 
but when the sonants are voiceless their cavity (or local) friction 
seems to have prevented this comparison; 41 actually it should not do 
so, since the vowels when voiceless also have cavity friction. 

If both articulatory and acoustic criteria are to be used to dis- 
tinguish consonants from vowels, a descriptive order is needed for 

35 For strictural function in the closure of the vocal cords and opening of 
the whisper glottis see p. 136. 

36 See p. s, n. 1. 

37 Cf. Jones, §§ 776-77; Kenyon, 38; Noel-Armfield, 8n.; Westermann and 
Ward, 85; Ripman, 21, 59. 

38 Cf. Bloomfield, 96; Noel-Armfield, 9, n. 1; Sweet, 31. 

39 Cf. Bloomfield, 102. 

40 Bloomfield, 96-97. 

41 See Jones, 24, n. 3; Kenyon, 60; Noel-Armfield, 9, n. 1; Sweet, 31; cf. 
Bloomfield, 102. 
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their application. A certain criterion should always be applied first, 
another second, and so on, regardless of where the procedure leads; 
it cannot give exactly the same results as are now obtained. There is 
no objection to supplementing articulatory criteria with auditory 
acoustic aids if they can be controlled; in fact, we have suggested 
above (pp. 71-72) an extension of types of friction criteria. How- 
ever, the application of these criteria must be in strict order, or over- 
lapping groups will be obtained; consistent results necessitate con- 
sistent procedure. 

Contextual function is the third major type of criterion which 
phoneticians have used for distinguishing vowels from consonants. 
Where one sound is affected in any way in its classification because 
of its relationship to neighboring sounds or to a system, contextual 
function has caused the change, instead of its position having been 
determined by its own absolute articulatory or acoustic nature. Con- 
textual function of segments is of two kinds: (1) function in a larger 
phonetic unit, the syllable, or (2) function in a linguistic system as 
a phoneme with pattern relationships. 

Stetson states that the decision whether a sound is a consonant 
or a vowel depends upon “ what it does in the syllable.” 42 If he 
were consistent in the application of this criterion, every syllabic 
would be a vowel, and every nonsyllabic a consonant. This shows 
contextual function of the type which is phonetic because of the re- 
lationship of the sound to a phonetic unit, the syllable. 

Jones uses contextual function in vowel-consonant classification 
in many different ways. One of these has to do with relative force. 
Sounds which would be vowels in isolation he classes in one place as 
consonants because they are weaker than certain others in the im- 
mediate context: “ The palatal and velar frictionless continuants 
have the organic positions of close vowels. In fact they are vowels, 
but they are uttered with very little breath force as compared with 
the normally pronounced vowels which adjoin them in connected 
speech. These frictionless continuants are to be considered as con- 
sonants on account of their consequent lack of prominence as com- 
pared with the adjoining vowels.” 43 

The same relative (contextual) criterion of force is used in con- 

42 Stetson, “ Motor Phonetics . . . ,” Arch. Neer. Phon. Exper., 3 (1928), 20. 

43 Jones, 46, n. 5. 
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junction with other criteria when Jones states that glides (or even 
sounds in the same phoneme but not gliding 44 ) are consonants rather 
than vowels; the decisive factor is their “ lack of stress as compared 
with the succeeding vowel.” 45 

A similar situation is seen in regard to quantity. The shortness 
or rapidity of a sound relative to others is used by Jones and Kenyon 
as one criterion for the consonantal nature of glides. 46 

Voegelin mentions that Shawnee exhibits the usual vowel-conso- 
nant distinction “ on distributional as well as on phonetic grounds.” 47 
That distribution feature which Voegelin has here pointed out as a 
phonemicist is probably the most influential of all criteria for actually 
determining the border lines at which particular phoneticians separate 
consonants from vowels. Although many articulatory or acoustic 
criteria may be mentioned by an author, somehow the resultant group- 
ings always reflect the phonemic division as seen in permitted group- 
ings of phonemes, sequences of syllabics, and the like. This is the 
reason that the articulatory and acoustic criteria already discussed 
have not been consistently applied, since they were not the deciding 
factors; authors have simply attempted to give an articulatory ra- 
tionalization of the consonant-vowel division after it had been reached 
(consciously or unconsciously) on contextual grounds. 

Phoneticians of course want practical results. Yet “ practical ” 
divisions, like “ linguistic instinct,” tend to reflect linguistic usage, 
which in turn inevitably means contextual function. Kenyon demon- 
strates the relationship between the “ practical ” criterion and the 
distributional one: “ Vowels and consonants have many features in 
common. It is the predominating features that are significant for 
each class by itself. These predominating features determine the 
practical use of the vowel and consonant sounds in actual speech.” 48 
“ Walker (1791) pointed out that the sounds [w] and [j] were treated 
popularly as consonants in actual unconscious speech by the use of 
the indefinite article [a] ( [a wok, a jok] ) . Likewise today the definite 
article [ 3 a] and the preposition [ta] are used ( [8a wak, 8a jok, ta 
W 33 -, ta jusrap] ) as before other words beginning with consonants. 

44 Jones, § 799. 

45 Jones, §§ 102, 800. 

46 Jones, §§ 102, 800; Kenyon, 155. 

47 Voegelin, “Shawnee Phonemes,” Lang., 11 (1935)1 2 Z~Z 1 - 

48 Kenyon, 57; cf. p. 233. 
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But [an], [ 3 i], and [tu] are used before [u], [i], and [3] ([an 
uzig sprit), 81 ist, tu 583], etc.).” 49 

Instead of the predominating features determining use in speech, 
use in speech has determined Kenyon’s choice of “ predominating 
features.” In appealing to popular use he appeals to his most basic 
(but assumed) criterion, contextual function. The sandhi of addi- 
tion of [n] in the indefinite article before vowels merely shows that 
[w] and [j] are in the same distributional group with [p], [s], and 
so on ; it proves nothing as to the innate articulatory movements pro- 
ducing the sound, or as to the acoustic result. When, on the one hand, 
Kenyon’s acoustic and articulatory criteria are not consistently ap- 
plied, but, on the other, his divisions correspond fairly closely to pho- 
nemic ones, the conclusion is inescapable that his basic criteria have 
been phonemic assumptions. Presumably some such factor explains 
his surprising conclusion that [w], [j], and [r] when in certain pre- 
vocalic positions are consonants, but when final in [ai], [aw], and 
[aa ] are “ nonsyllabic vowels, both in function and organic nature.” 50 
Change in phonemic usage should not change phonetic description if 
the absolute nature of the sound is considered to be unchanged. 

If a syllabic is established acoustically by relative loudness 51 or 
prominence 52 it cannot be any innate natural sonority which always 
forces certain sounds to be syllabic, or else there could never be any 
variation between a sound which, in the same phonetic context, is 
sometimes syllabic and sometimes not, like English sonants [ 1 ], [r], 
[m] , and [n] , or like nonsyllabic vowels. The syllabic nature of any 
particular utterance of a sound is produced by a type of syllabic func- 
tion which in turn is a type of contextual function even though this 
function is conditioned by articulatory movements. 

When Jones says that “ It is as a consequence of this principle 
of relative prominence that certain short vowel-glides must be re- 
garded as consonants,” 53 we are immediately aware that he is using 
contextual function as his criterion. Since we have already mentioned 
his use of two other contextual factors (relative force and relative 
duration) but of only one articulatory one (movement — and this is 

49 Kenyon, 60. 

50 Kenyon, 234. 

51 See Bloomfield, 125. 

52 See Jones, §§209, 211, 215, 218; Noel-Armfield, 51. 

53 Jones, § 102. 
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not consistently applied) it becomes obvious that for Jones, as well 
as for Kenyon, glides are consonants, not because of their basic articu- 
latory or acoustic structure, but simply because of function in a 
phonetic or phonemic context. 

Syllabic contextual function is reflected in phonetic alphabets. 
Sounds which are described by the same procedure but which are 
used differently in phonemic systems as syllabics in contrast to non- 
syllabics are given different symbols, 54 and at times are given names 
such as “ semivowel ” and the like. This is occasionally an evidence 
of indecision on the part of the authors as to whether certain sounds 
should be considered consonants or vowels (or some type halfway 
between, in spite of a desired sharp division between vowels and con- 
sonants 55 ) when contextual criteria point in one direction but acoustic 
and articulatory features of the segments point in another. 

In addition to the factors of sonority, friction (see p. 72), and 
speech normality (see pp. 5-6), which have determined the exclu- 
sion of voiceless vowels from vowel definition, contextual factors have 
contributed to that result. In spite of the fact that Jones allows only 
for voiced types in his vowel definition, he admits in various places 
that [h] sounds “ are simply vowels pronounced with breath instead 
of voice ” 66 ; the [h] sounds could in “ a very narrow transcription ” 
be written with voiceless vowel symbols, but such a mode of representa- 
tion would be “ both inconvenient and unnecessary.” 57 The reason 
it is unnecessary is that the different voiceless vowels constitute a 
single phoneme in English, which would not be at all true if thev 
were in phonemic contrast. 

The distributional criterion is seen in Kenyon, where [h] is said 
to occur only before “ sounds with unobstructed outflow of breath ” 58 ; 
this is an item of English phonemic distribution having no essential 
relationship to [h] types as acoustic phenomena since they can also 
be pronounced in isolation or in other phonetic contexts. Kenyon 
objects to considering any [h] a voiceless vowel; 59 the phonemicist 
who must study languages yet unreduced to writing knows, however, 

54 See Bloch and Trager, 2. 

55 See Noel-Armfield, 9, n. J. 

56 Jones, §§ 782, 777 . 783. P- 186, n. 43. 

57 Jones, 186, n. 43; § 777. 

58 Kenyon, 138. 

59 Kenyon, in Webster, §44 (10). 
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that he cannot tell whether a certain sound is an “ h ” or a “ voiceless 
vowel ” until he can classify its distributions and contrasts to see how 
it patterns with other consonants or vowels. 

Jespersen states that “ to be conceived as an f h] the voiceless vowel 
must be pronounced immediately before or behind a voiced vowel.” 60 
This represents a still different setup for distributional determination 
of consonant and vowel. 

The question needs to be raised as to what are the aims of phonetic 
description. Are they to show the acoustic or articulatory character- 
istics of sounds, or are they rather to show interrelationships of the 
sounds of systems? Regardless of which the phonetician feels is his 
task, or whether he undertakes both, the two aims must be kept strictly 
apart. The phonemicist, dealing with the second, has come to see that 
from the point of view of the description of the functional structure 
of a system the particular phonetic value of a sound is not always 
especially pertinent to its place in the system ; 61 the essential element 
is a group of acoustic symbols in certain interrelationships. 

The phonetician has not yet realized that for describing the nature 
of a sound its phonemic relationship to other sounds and its relative 
value, rather than its absolute type, are in turn not pertinent to his 
description of the sounds themselves. In fact, to the very extent 
that the phonetician makes his classification dependent upon contextu- 
ally relative features, he has made himself incapable of describing the 
sounds as units themselves. If the single sounds of a continuum can- 
not be described without reference to each other, then the continuum 
can be described only as a whole, not in parts; but if its parts cannot 
be described first, how can it be described as a whole? 

A phonetic system should be able, within the limits of the accuracy 
and finesse of its articulatory, acoustic, or imitation-label procedures, 
to describe any sound in isolation, or in nonsense syllables, or as cut 
from a continuum of speech, without the necessity of referring to 
other sounds in the context to find criteria for its classification. A 
phonetic science should be able to define and describe its own units 
by its own data; if it cannot, how can it later describe for the pho- 
nemicist the speech system which the latter has presented? If the 

60 Jespersen, Articulations, 71. 

61 Cf. Bloomfield, “ Menomini Morphophonemics,” Htudes phonologiques 
dediees a la memoire de M. le prince N. S. Trubetzkoy ( Travaux du Cercle 
linguistique de Prague, Vol. 8), 107, n. 1. 
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phonetician first delimits supposed articulatory classes by phonemic 
features, how can he then describe the phonemes with articulatory 
methods? Any such attempt presents a vicious circle of phonemics 
to phonetics to phonemics, with the phonetician starting at phone- 
mics. A sound is the same sound wherever it occurs — but for the 
phonetician who uses essentially variable phonemic criteria to describe 
that sound, it may become a chameleon and change its color in dif- 
ferent languages or even at different points in the same language. 

In this study I attempt to draw up a phonetic scheme in which 
any sound can be described without reference to contextual function 
of the phonemic kind (i.e. sounds in speech) . The invention of a con- 
siderable amount of new terminology has been essential, in spite of 
the fact that many current terms have been retained after redefinition. 

Vocoid and contoid groups are strictly delineated by the articu- 
latory and acoustic nature of sounds, without reference to phonemic 
contextual function. The terms approximate current groupings of 
vowel and consonant in some of their major phonetic characteristics. 

Vowels and consonants are then categories of sounds, not as de- 
termined by their own phonetic nature, but according to their group- 
ing in specific syllable contextual functions. This definition is flexible, 
awaiting elaboration or phonemic modification for each particular lan- 
guage after phonemic contextual data for the language have been as- 
sembled. 

I have found just one work (Bloomfield) which makes the divi- 
sion between a general phonetic articulatory definition of vowel and 
consonant, and a phonemic definition for particular languages; un- 
fortunately, this excellent lead has not been followed by subsequent 
phonetic writers. Bloomfield first gives an articulatory definition of 
the vowel , 62 then later states that it is convenient to use this and other 
terms in different ways and to supplement them for description of in- 
dividual languages . 63 He proceeds to demonstrate convincingly the 
way the phonemicist can define his flexible terms to fit a particular 
language; his sample is English . 64 

No other phonetic dichotomy entails so many difficulties as con- 
sonant-vowel division; articulatory and acoustic criteria are there 
so thoroughly entwined with contextual and strictural function and 
problems of segmentation that only a rigid descriptive order will sep- 

63 Ibid., 102 n. 64 Ibid., 130. 


62 Bloomfield, Lang., 102. 
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arate them. Once granted this order, however, further criteria fit 
the pattern quite readily for the remainder of phonetic classifications ; 
lacking it, difficulties initiating at this point carry clear through a 
system . 65 

Criteria for various series of sound types need no detailed dis- 
cussion at this point; most of them have been given directly or by 
implication. Nasals as distinguished from stops and vowels have 
already been mentioned (with cavity function separating nasals from 
vowels) ; fricative or frictionless laterals are characterized by front 
contact and side opening, in contrast to the side contact and front 
opening of central fricative and frictionless continuants; oral frica- 
tives are separated from vowels by an acoustic criterion; affricates 
and diphthongs break up phonetically into individually described seg- 
ments. 

One further item merits some mention. Some articulators, espe- 
cially the tip of the tongue, can touch at more than one point, for 
example, the teeth, or the dome of the mouth. For many sounds 
it proves more convenient to assume a norm in which an articulator 
touches a certain spot (e.g. back of tongue on velum). Sweet com- 
ments that an alphabet could use separate signs for these factors , 06 
although in general he does not attempt such detail; Jespersen criti- 
cizes Bell’s visible speech for not doing so, 6T and then provides for 
them in his own analphabetic system. Choice of such representation, 
along with the number of points of articulation to be shown in basic 
symbols, seems to be a question of practical rather than theoretical 
limits . 68 If such a term as “ labiodental ” is to be used to show an 
articulator in relation to its point of articulation, then, obviously 
enough, “ labiovelar ” should not appear in the same series with it. 

65 Cf. Jespersen, Articulations, 5, for criticism of Bell’s system of symbols. 

66 Sweet, 39. 

67 Jespersen, Articulations, 5. 

68 See Bloch and Trager, 2. 




PART II 


PHONETIC THEORY: A CONSTRUCTIVE 
SYSTEM 



INTRODUCTION 


I N PART I the difficulties were discussed which remain in phonetic 
theory because of a legacy from prephonemic days when phonetics 
and phonemics were one. Sounds were shown to have been chosen for 
description because of their use in speech rather than because of their 
articulatory or acoustic nature; many sounds were seen to have been 
ignored because they were not produced in a certain way; classifica- 
tions had numerous basic assumptions behind them which were not 
apparent since the contrastive data were not given due attention. 

In Part II of this study a phonetic classification is presented which 
should allow, with slight modifications or minor additional categories, 
for the analysis of all sounds — not just a group determined by non- 
articulatory means. This gives an opportunity to emphasize many 
factors which are important to phonetic theory and to contrast them 
with other items within a unified system. The defining of assumptions 
essential to such theory has made it possible to eliminate certain other 
assumptions related to phonemics that are undesirable here. 

The phonetic materials which have served as a background for 
this study were collected by me during six years of investigation of 
living languages with informants in the field. During that time an 
endeavor was made to analyze and record the sounds of the following 
languages: Amuzgo, Aztec, Cakchiquel, Cherokee, Chinanteco, Dela- 
ware, Huaxteco, Lithuanian, Maya, Mazateco, Mixe, Mixteco, Nav- 
ajo, Ojibwa, Otomi, San Bias, Spanish, Tamil, Tarascan, Tlapaneco, 
Totonaco, Trique, Tzeltal, and Zapateco; all but three of these (Lithu- 
anian, Spanish, Tamil) are Indian languages of North America. 

Chapter VI attempts to relate the production of the vast majority 
of vocal sounds to a simple set of mechanical processes which initiate, 
compress, rarefy, combine, or reinforce air streams. Both speech 
sounds and nonspeech sounds use these processes. 

Chapter VII describes the methods by which the air streams are 
controlled and impeded so as to result in different kinds of sounds 
within any one major type of mechanism. The air streams passing 
through open cavities with as little obstruction as possible produce 
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only weak sounds; when passageways are partly closed, however, 
strong sound vibrations may be set up. A discussion of the different 
moving parts which can cause such interference, the shapes they may 
assume in the process, the places at which the interference takes place, 
the manner in which these moving parts form the obstruction, and the 
resultant differences in types of sound, constitutes a considerable part 
of this chapter. The first section delineates the natural segmentation 
which was postulated in Chapter III and which proves to be condi- 
tioned by these obstructing movements. Later in the chapter the 
differences of strictural function which were noted in Chapter IV are 
given a working classification. 

Chapter VIII, which concludes the study, has two main features: 
( i ) A resume is offered of sound types which do not seem to occur 
in speech. Although certain types of sound-producing mechanisms 
are apparently never used in phonemic systems, others may be. This 
forms the basis for a statement of a line of cleavage between non- 
speech sounds and speech sounds. (2) A summary is presented of 
certain features which may be described profitably for any sound to 
classify it by its productive mechanism and, roughly, by its controlling 
mechanisms. A symbol is provided for each of these factors, resulting 
in an analphabetic system of phonetic notation whose chief value lies 
in calling one’s attention to the great number of assumptions which 
underlie our normal (and much more useful) phonetic symbols, the 
letters of the alphabet. 



CHAPTER VI 


PRODUCTIVE MECHANISMS 

TINT CHAPTER I the need was indicated for a theory that would 
show basic relationships between all types of sound mechanisms 
which involve compression and rarefaction. The present chapter is 
designed to meet that need by describing the air chambers and move- 
ments involved in various compression-rarefaction systems and to 
contrast these with other productive sound mechanisms. 

Sounds are vibrations with characteristics of frequency, intensity, 
and duration which produce certain sensations of audibility when im- 
pinging upon the ear. Sound waves may be set up in the physical 
world in a variety of ways. A few of these methods concern us here: 
One object striking against another, as when a hammer hits an anvil, 
sets up vibrations. One object scraping or rubbing against a second 
may likewise cause sound waves, as when a pen scratches on paper. A 
blast of air may be set in vibration by going over a sharp ledge or 
through a narrow orifice, as when air escapes from an automobile tire. 
The air may put an object in vibration, as does the breath of a clarinet 
player the reed of his instrument. A body of air may vibrate, as does 
a resonance chamber below a tuning fork. Vibrations may travel from 
one medium to another ; the dropping of a book in one room may be 
heard in another as sound waves travel through the partition and in- 
duce vibrations in the adjoining room. All these methods can be dis- 
covered in the production of vocal sound, although a large majority 
of the sounds utilize a moving column of air. 

Air-Stream Mechanisms 

Within the vocal apparatus there are five cavities (see Fig. i). 
These embrace the oral, nasal, pharyngeal, pulmonic, and esophageal 
(including the stomach) cavities. A back velar (or uvular) closure is 
considered to be within, but terminating, the oral cavity; analogously, 
a velic closure (“ velic ” used, as in Part I, to indicate the upper part 
of the velum facing the nasopharynx) lies within the nasal cavity; 
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glottal and esophageal closures are contained within the pharyngeal 
cavity. It does not seem expedient to attempt to work with a classifi- 
cation which would separate the laryngeal and pharyngeal cavities, or 
the esophageal and stomach cavities. 

Any cavity or part of a cavity which is completely shut off by itself 
from others by some closure, or any group of cavities or parts of 



Fig. i. Cavities Fig. 2. Chambers 

E, esophagus; L, lungs; M, mouth; A, parts of the active chamber; P, 

N, nose; P, pharynx passive chambers. The sound is 

[tl 

cavities which are united by connecting passageways of air, forms 
an air chamber. Thus, during the stop [p] there are three air cham- 
bers: the nasal, the oral-pharyngeal-pulmonic, the esophageal. With 
a nasalized vowel [a], only two air chambers are present: the eso- 
phageal and the oral-nasal-pharyngeal-pulmonic. With [n] there 
are three: esophageal, front oral, and (back) oral-nasal-pharyngeal- 
pulmonic. 

The air chamber in specific use for a particular sound, the one 
which contains the air stream, is the active chamber (see Figs. 2-3, 
9). All other chambers are passive (see Figs. 2-3). In the examples 
given in the preceding paragraph the esophageal chamber was always 
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passive. The nasal chamber is passive for all oral sounds; the oral 
and nasal chambers are passive for glottal stop. During [p] the oral- 
pharyngeal-pulmonic chamber is active, and so on. 

When one is describing sequences of segments, another term is con- 
venient too. A passive chamber is also semipassive (see Fig. 7) when 
due to become active during the next sound. With [k] the oral cham- 



Fig. 3. Side chamber 

A, active chamber; P, passive 
chamber; S, side chamber. The 
sound is [n] 



Fig. 4. Capped air stream 
C, air stream; I, initiator. The 
sound is [t] 


ber (the part in front of the closure) is passive, but in [k] followed by 
[w] or [a] it is at the same time semipassive, since after the velar 
release air will be passing through the oral cavity and making it active. 

When part of an active air chamber does not have the air stream 
passing directly through it, but only by it, that part becomes a side 
chamber (see Fig. 3). This is seen almost exclusively during the 
production of front nasals such as [m] or [n] . The air comes from 
the lungs out through the nose, but does not pass directly through the 
mouth even though the mouth has direct connection with the pulmonic, 
pharyngeal, and nasal cavities and affects the total resonance of the 
sound. 

Certain parts of the walls of an air chamber may be flexible: any 
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part is an initiator (see Figs. 4-10) of an air stream at the time the 
part is moving within the chamber, making the chamber smaller or 
larger. The bellows of an accordion constitute an initiator ; the total 
air chamber is made larger and smaller by its movement, and this 
movement initiates the air stream which vibrates the reeds. The 
lungs are the most frequently used initiator in the production of vocal 
sound. Other initiators are the moving larynx, tongue, lips, esophagus, 
and so on. They will be discussed separately in a moment. (All 
English phonemic norms are produced while the lungs function as 
an initiator; various clicks, and similar sounds, of African languages 
entail some other initiator.) 

With but few exceptions, the initiator is on the inward side of its 
air chamber: the lungs are inward in respect to the oral and nasal 
cavities ; the velar closure and movement of the back of the tongue are 
inward in respect to the oral cavity employed in clicks; and so on. In 
a reversed mechanism the initiator is outwardly situated. An example 
of this is the sound caused by suction when the tip of the tongue moves 
upward against the lower lip (see Fig. 9). 

When an initiator moves toward the center of its chamber it makes 
the chamber smaller (contracted) and compresses the air contained 
therein. If there is an opening in the chamber at the mouth or nose 
an air stream is forced outward. All sounds made in this manner are 
egressives (or compressives) (see Figs. 4, 10); [m], [ 1 ], [s] are 
samples. If the initiator moves away from the center of its air cham- 
ber it makes the chamber larger (expanded) and produces a partial 
vacuum. If an opening exists in the chamber at the mouth or nose 
an air stream is drawn inward. Sounds made thus are ingressives (or 
raref actives) (cf. Figs. 5-8); ingressive lung-air sounds, and clicks, 
are produced in this way. 

If no opening at the mouth or the nose is provided for compressives 
(e.g. stops fp], [t], [k] ) , an air stream is initiated which continues 
only until the pressure is equalized throughout the entire chamber. 
Pressure of the lungs in [b], for example, may force enough air into 
the mouth through the glottis to vibrate the vocal cords for a moment, 
but as soon as the pressure in the mouth equals that of the lungs, no 
more air will pass the vocal cords (and hence vibration will cease) 
unless the cheeks are expanded to provide further place for entrance 
of air or unless the velic or labial closure is released (which, of course, 
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changes the stop into a continuant). In such a situation, either be- 
fore or after the equalization of pressure, the air stream is capped 
(see Fig. 4). With rarefactive stops the situation is in reverse. The 
air stream persists in the direction of the lowest pressure area until 
the pressures are equalized. 

An initiator with its full gamut of possible air chambers is an air- 
stream mechanism, or, more briefly, an air mechanism (see Figs. 5-6). 
The initiator with the adjacent part of the air chamber constitutes the 
bellows, but the remainder of the air chamber provides the pipes or 
keys. Just as a musical instrument does not have to employ every 
key or pipe at the same time, so an air mechanism may for a single 
sound use just a small part of the space of its potential air chambers. 

Any air chamber, even if passive, is a potential air-stream mech- 
anism, since the addition of an initiator would turn it into an active 
air mechanism. The mouth cavity, for example, which is passive dur- 
ing the closure for [k], is a potential air mechanism since it would 
produce some type of click if the tongue should move backward to 
initiate an ingressive oral air stream. 

Three air-stream mechanisms may be considered to be major types, 
as contrasted with several minor ones, since the former can produce 
many kinds of sounds and series of sounds, whereas the latter are 
extremely limited in productive possibilities (although the esophageal 
type is limited less than other minor mechanisms) . 


MAJOR AIR-STREAM MECHANISMS 
Pulmonic Air-Stream Mechanism 

The lungs acting as an initiator may press lung air outward through 
the pharynx and mouth or nose (or some connected combination). 
This may be done in a rapid burst, as after a cough, where the pressure 
is obvious, or very slowly, as for a sustained frictionless continuant 
whose pressure is slight but which may be continued for about half 
a minute. During inspiration the enlarging of this chamber may like- 
wise be rapid or sustained, the time being limited by the capacity 
of the lungs to continue expansion. For a diagrammatic indication 
of the movement of the initiator and the air stream in [t] see Figure 4. 

More sound types can be produced with lung air than with any 
other mechanism. All English phonemic norms use egressive lung 
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air only. Both egressive and ingressive stops, fricatives, frictionless 
spirants, vocoids, trills (most ingressive trills are extremely difficult to 
produce and differ acoustically and structurally from egressive types), 
laterals, and so on , 1 can be produced by this mechanism. 

In order to show the interaction and interdependence of initiators, 
air streams, and certain typical strictures, and the simultaneous action 
of several mechanisms, the following symbols are presented, which 
apply throughout the sample diagrams here and later. The diagrams 
are to be read from left to right. When [h] is given, its purpose is 
to allow demonstration of releases and so on; the sound is assumed 
for convenience to be a type of voiceless vocoid which has the tongue 
flat in the mouth and, therefore, no vocoidal partial stricture. 


1 lips 

b back part of tongue 
v velic 
g glottis 
w wall of lungs 

closure 

fricative stricture 

, , , , frictionless stricture 
.... zero stricture 

vibration 


■ > > > > movement (of air 
stream or initia- 
tor) outward 
movement inward 

L lung air 
P pharynx air 
M mouth air 
c c c c capped compressive 
air stream 

r r r r capped rarefactive 
air stream 


[hph] 

1 

b 

v 

g 

L ->-y c c c c-» 


M 



Ingressive [x] 

1 . . . . 

b 

v 


g . . . . 



Pharyngeal Air-Stream Mechanism 

The larynx with glottis closed (but for voiced types see combina- 
tions with lung air) may be thrust upward, compressing the air 
beyond it. If an egress is provided through the mouth or nose 
pharynx air will rush out. When there is no egress, a pharyngeal 

1 In this chapter these terms will be used with their traditional meanings. 
In Chapter VII certain of them will be redefined in slightly different ways. A 
“ vocoid ” is an articulatory approximation of a vowel ; a “ contoid ” is such an 
approximation for a consonant. For their further definition, and for a con- 
textual redefinition of “ vowel ” and “ consonant,” see pp. 143-45. 
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capped air stream results; these are glottalized stops (the “ ejectives ” 
of British usage, or the “ glottalic pressure stops ” of Catford). The 
larynx may likewise be lowered, rarefying the air in that section of 
the chamber, and setting up an ingressive air stream to fill the gap. 
Stops of this nature are implosives (described by some writers as 
“glottalic clicks,” or “glottalic suction stops”). See Figure 5 for 
a diagram of this mechanism. 




Fig. 5. Pharyngeal air Fig. 6. Oral air mecha- 
mechanism nism 

The long arrows show potential initiator movement; the 
broken lines show potential movement of pharyngeal air 
streams (Fig. 5) and of oral air streams (Fig. 6) 

Sounds with pharynx air include the same general types as with 
lung air, subject to certain changes and limitations. The comparison 
does not cease with stops, but includes fricatives, frictionless spirants, 
vocoids, and the like. Egressive pharynx-air sounds have glottalized, 
timbre; ingressive, implosive timbre. Because of the weakness and 
shortness of duration of the air stream, which even with a fricative 
stricture can be maintained for only a few seconds, the majority of 
pharynx-air frictionless continuants tend to have zero audibility. The 
whistles, frictionless trills, and lip “ voice ” (see p. 126) provide occa- 
sional exceptions. The most frequent exceptions are the percussive- 
transition pharynx-air vocoids (see pp. 93—94, 104, 115), which con- 
stitute the loud “ pop ” due to the front oral release of nonaffricated 
clicks (or even weaker sounds following the release of some close 
fricatives). For the sounds commonly called “glottalized vowels” 
(not made with pharynx air) see the discussion of laryngealization on 
pages 127-28. All sounds with this mechanism are, in the nature 
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of the case, completely voiceless, since voice is a function of the vocal 
cords during the action of the pulmonic mechanism (combinations of 
the two mechanisms will receive attention below, pp. 94-96). Lip 
“ voice ” (see p. 126), however, is clearly audible. Modifications such 
as labialization produce audible effects upon the release of stops and 
upon fricatives; lip rounding as a primary articulation, if it is to be 
audible by itself without other strictures, has to be of the fricative, 
or else the whistle, type. Certain whistles produced by pharynx air 
are readily audible, even if brief. Certain trills (e.g. alveolar) can 
be made with this mechanism, but others (e.g. full bilabial) are dif- 
ficult since they require a heavy air stream. No sound types can be 
made by this mechanism alone which with pulmonic mechanism would 
demand any kind of complete or partial glottal stricture (e.g. glottal 
stop, or whisper), since the glottis is already closed as initiator and 
needs strictures of other functions beyond it for sound production; 
the vibrating glottis in voiced types is not a function of this mech- 
anism. 


Velar glottalized Bilabial implo- Velic glottal- 

stop, plus {_ h] sive fricative ized fricative 

1 1 1 

b ... b . . . . b . . . . 

v v v 

g >>>>>> • • g <<< - < g > - >>> 

w ->-y P -< - < - < - «■ P > - > - > - > 

L -»-> 

P c c c c->-v 


Three further diagrams are given to illustrate types of release of 
a velar glottalized stop: (1) affricated, and with quick glottal release, 
(2) affricated, but with slow glottal release, (3) nonaffricated, but 
followed by a percussive-transition glottalized vocoid and then delayed 
glottal release. Already illustrated above is the type which is non- 
affricated and followed by a percussive-transition glottalized vocoid 
and then quick glottal release. 


k ? x’ A 

1 

b ... 

v 

g >>>>>> • • • 

w ( >>>> ) >> > > 
L (cccc) > > ■ » 
P c c c c ->-> 


k ? x ? ? A 

1 

b 

v 

g >> ■ >>>> • • 

w (- >>>> ) >>>>>> 

L (cccc)c c > > > > 
P c c c c» 


k ? A’ ? A 

1 

b 

v 

g > > - > - > - >> ■ • 

w ( ■ > ■ » > ) >>>>>> 

L ( c c c c ) c c ->- > > > 
P c c c c-w 
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Oral Air-Stream Mechanism 

The back part of the tongue while touching the velum may be 
thrust forward, compressing any mouth air caught between it and 
some stricture farther forward, at the tongue tip or lips or the like; 
if a place of escape is provided, an egressive air stream is produced. 
The back part of the tongue may be moved farther back in the 
mouth while maintaining its velar closure and in this manner rarefy 
the air in the chamber and initiate an ingressive stream to the mouth ; 
the sounds of kissing, or of clucking to horses, are produced in this way. 

This mechanism (see Fig. 6) is used for sucking water up into a 
straw; the rarefaction of the oral cavity allows air pressure to push 
the cheeks in simultaneously. If one has his mouth full of water and 
ejects the liquid forcibly a distance of a few feet, a modified form 
of the egressive oral mechanism is being used. That neither the suc- 
tion nor the pressure is derived from the lungs in these experiments 
can be demonstrated by breathing out from or into the lungs by the 
nose at the same time; the oral cavity is a separate unit. A nursing 
baby uses a type of ingressive oral mechanism, but with certain 
peristaltic modifications in tongue movement which are not pertinent 
to sound production. 

Like the pharyngeal air mechanism, the oral one has small expan- 
sion-contraction powers. Sounds produced by it rarely last more 
than a fraction of a second, though their spirants can be continued 
weakly for a second or two. 

Subject to the following limitations, sounds of the oral air mech- 
anism comprise the same stop, fricative, frictionless spirant, and 
vocoid types as those produced by lung air: sounds with ingressive 
mouth air have click timbre; with egressive air, egressive-click timbre 
(but in contrast to lung-air and pharynx-air sounds, the ingressives 
are the more common in speech). Only sounds produced by oral 
strictures may occur with this mechanism, since the pharyngeal and 
nasal cavities are completely outside it (except following velar re- 
lease) ; nasalized or voiced types are achieved only in combination 
with the pulmonic mechanism. Mouth-air frictionless continuants 
tend to have zero audibility except for whistles, frictionless trills, 
lip “ voice ” (see p. 126), and percussive-transition click vocoids (see 
pp. 91, 104-5, n5), which usually (that is, except for affricated 
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clicks) constitute the loud “ pop ” in the instant following the front 
oral release of a click stop. When the velar closure releases first, at 
the end of a click stop, or forms a fricative partial stricture at the 
same time that it is moving as an initiator, egress or ingress of the air 
stream is by way of the pharynx or nose. 


A kiss 

1 

M r r r 


Bilabial click 
fricative 

1 

b <<< - < - 
M < <<< 


Bilabial egressive-click 
stop, plus [hj 

1 ... 

b >>>>>> • • 

g 

w -v-»- 

L ->-> 

M c c c c 


For types of vocoidal or affricated releases one may draw analogies 
from pharynx-air diagrams (see p. 92). 

Certain differences of tongue position may modify the character 
of the oral air mechanism. Although the air chambers may be quite 
similar, the initiator may at various times comprise larger or smaller 
sections of the wall of the chamber. Parts of the tongue other than 
the back portion may make the closing contact, be raised and fronted, 
backed and lowered, in conjunction with it; the mid section may be 
lowered while the back part is relatively stationary and still produce 
effects similar to the type in which the back part of the tongue is 
moving. 

In fact, a number of very loud and sharp click stops come from 
a type of oral mechanism whose air chamber is zero, since the whole 
surface of the tongue is in close contact with the roof of the mouth. 
The initiator is the lowering, backing, and contracting tongue in its 
entirety (if we ignore the root of the tongue outside the oral cavity), 
except that a back point retains contact on the velum. From this 
type of mechanism no egressive sounds can emerge since within the 
zero air chamber there lies no air to be compressed. 


COMBINATIONS OF MAJOR AIR-STREAM MECHANISMS 

Three major mechanisms have been listed: pulmonic, pharyngeal, 
oral. Since their productive initiators are separately placed, with 
relatively independent muscle systems, they can function simultane- 
ously if the air chambers needed by each can either be kept separate 
or serve in two functions at the same time. The pulmonic mechanism, 
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for example, usually includes the oral cavity within its active air 
chambers; for the oral cavity to become an oral mechanism at the 
same time, therefore, the pulmonic mechanism must either confine 
itself to air chambers which do not include that cavity or else manage 
to use it concomitantly. A statement of possible combinatory types 
follows. 


Pharyngeal Mechanism plus Pulmonic Mechanism 

The pharyngeal mechanism in its normal function requires a closed 
glottis, so that when the larynx is raised or lowered compression or 
rarefaction may follow. If the glottis were wide open no pharyngeal 
compression or rarefaction would take place, and the only modifica- 
tion of the pulmonic sound would be caused by the expansion or con- 
traction of the pharyngeal cavity as part of its air chamber; no air 
stream would be initiated in the pharynx. 

On the other hand, the only pulmonic sound which can be made 
with glottal closure is the glottal stop. This might appear to prohibit 
entirely any combination of the two mechanisms. 

The rarefaction developed by the laryngeal movement, however, 
is slightly more than is needed for the sounds of the pharyngeal 
mechanism. A slight leakage of air from the lungs through the glottis 
may be sufficient to vibrate the vocal cords, while not destroying the 
partial vacuum produced by the lowering larynx which sucks air into 
the mouth and pharynx. This produces the sounds [6], [rf], and the 
like (see Fig. 8), often known as “ implosives.” For a mechanical 
illustration by stream cylinders see Catford in Le Maitre Phonetique, 
3d Series, 65 (1939), 4. 

Since under these conditions the pharyngeal and oral cavities con- 
stitute the air chamber for the pharyngeal mechanism and at the 
same time constitute the outer part of the air chamber for the pul- 
monic mechanism, two descriptions would at first seem to be possible: 
ingressive pharynx-air stops or fricatives could be said to be accom- 
panied by voice or else voiced lung-air stops and fricatives might 
be said to be modified by an ingressive air stream to the pharynx. 
The choice of the first description is made because the glottis function- 
ing as vibrator is given a lower place in the descriptive rank of 
strictural function (see pp. 130-31, 133) than are the oral strictures 
prominent to the implosion. 

Three other types of combination are much more difficult to pro- 
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duce, and sometimes the results are of doubtful validity. Voicing by 
egressive pulmonic air can be added to egressive (instead of ingres- 
sive) pharynx air. This produces voiced glottalized stops and frica- 
tives. Here again the use of “ glottalized ” timbre to designate egres- 
sive pharynx air is different from the general practice; certain se- 
quences of (apparently fortis) glottal stop followed by voiced spirants 
at times combine to constitute single phonemes and are called “ glot- 
talized voiced spirants ” by a number of writers. 2 The combination 
into single phonemes of a glottal stop with other sounds in sequence 
might by the phonemicist be conveniently termed “ sequential glot- 
talization,” in contradistinction to the pharynx-air type. 

An ingressive voiced stop is difficult to produce with lung air. 
This difficulty is amplified when one tries to combine voicing from 
ingressive lung air with either egressive or ingressive pharynx-air 
stops. With the fricatives the difficulty is slightly less, but sufficiently 
formidable, so that I have thus far failed to produce many of the types 
postulated by these analogies. 

The mechanisms can be combined a bit more readily at times if, 
instead of voice, a glottal trill is substituted in each case, whether with 
ingressive or egressive lung air. The reason for this appears to be 
that a glottal trill uses a much smaller air stream than does voice 
and hence does not so quickly destroy the pharyngeal vacuum for 
ingressive pharynx-air types or add excessive pressure for egressive 
ones; correlated with this is the fact that the glottal formation seems 
to approach nearer to a position of closure than for voice (and for the 
same reason decreases the size of the air stream). The relation of 
glottal trill to voice will be discussed briefly in the next chapter 
(pp. 126-28). 

[6h] 
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2 Cf. Sapir, “ Glottalized Continuants in Navaho, Nootka, and Kwakiutl 
(with a Note on Indo-European),” Lang., 14 (1938), 248-74. 
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of [C] {i.e. [?B]) 
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Oral Mechanism plus Pulmonic Mechanism 

During the production of any mouth-air sound lung air of either 
ingressive or egressive type may cause simultaneous phenomena. The 
entire range of ingressive or egressive clicks can be produced under 
these conditions, but the lung-air sounds are limited to types which 
can be completely executed behind a velar closure, using pulmonic, 




Fig. 7. Alveolar click stop with Fig. 8. Voiced alveolar implo- 
voiced nasalization sive stop 

C, compression ; R, rarefaction ; SP, semipassive chamber ; — vibrating 
vocal cords; arrows and dotted lines show the direction of movement 
of initiators and air streams 

The sound represented in Figure 7 precedes an oral release 

pharyngeal, and nasal cavities ; the velar closure may do double duty 
— as initiator for the clicks, but as oral closure for the lung-air sounds. 

Sounds known as nasalized clicks (see Fig. 7) have simultaneous 
[q] ; voiced clicks have simultaneous [g] ; the air stream in this sec- 
ond type is capped at one exit by the velic closure and at the other 
by the whole oral click mechanism, but specifically by its velar closure- 
initiator. Various types of releases can be given; if the click releases 
first, then the velar closure can release as for a normal [g], and so on. 
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The vocal cords do not, of course, need to be vibrating for these 
combinations, so that the pulmonic contribution could be voiceless. 
Since the oral and pulmonic mechanisms have separate initiators and 
may use separate air chambers, their air streams do not have to be 
both egressive or both ingressive at the same time. All four possible 
combinations can readily be given: ingressive lung air plus ingressive 
mouth air; egressive lung air plus egressive mouth air; ingressive lung 
air plus egressive mouth air; egressive lung air plus ingressive 
mouth air. 


Voiced egressive nasalization 
of bilabial click fricative 
1 

b < < < < 
w ... . 


V oiced bilabial/ click 
stop, plus [h] 
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[h], plus voiceless ingressive nasalization 
of bilabial egressive-click stop, plus [h] 
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Oral Mechanism plus Pharyngeal Mechanism 

To any egressive click may be added a velic glottalized fricative 
or a velic-released velar glottalized stop; to any (ingressive) click 
may be added a velic implosive fricative or a velic-released velar 
implosive stop. Apparently the direction of the air streams must be 
the same, either both egressive or both ingressive; this has nothing 
to do with the theoretical possibility of ingressive pharynx air plus 
egressive mouth air, and vice versa, as far as the air chambers are con- 
cerned ; it appears to be simply a limitation of muscular arrangements 
such that it is physiologically difficult to be lowering the larynx while 
fronting the tongue. Perhaps the mechanical difficulty can be over- 
come; if so, the velic glottalized fricative can be made to accompany 
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an ingressive click, and so on. All these sounds are in the nature of 
the case voiceless, since voicing implies pulmonic mechanism for vocal- 
cord vibration. 

Velic-released velar glottalized slop with 
simultaneous bilabial egressive-click 
fricative, plus Df] 
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Oral Mechanism plus Pharyngeal Mechanism 
plus Pulmonic Mechanism 

The sounds described in the preceding paragraph can be voiced 
by having a leakage of lung air vibrate the cords during the move- 
ment of the pharyngeal initiator. The difficulties of voicing during 
ingressive lung air are here accentuated to such an extent by the extra 
mechanisms that of eight possible ingressive-egressive combination 
types I have succeeded in making sounds of one or two only. 

Voiced velic-released velar implosive stop with 
simultaneous bilabial click stop, plus [h] 
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MINOR AIR-STREAM MECHANISMS 

The esophagus can be expanded or contracted as an initiator of 
esophagus air, with sounds both ingressive and egressive. The rem- 
nant of the air chamber is the same as for lung-air sounds except that 
the pulmonic cavity is not included. A modification of this type 
entails use of the stomach, also, as initiator and part of the air cham- 
ber; compressive types may be made by gas. The esophageal mech- 
anism is employed principally for making belches. 
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The esophageal mechanism can be used to produce a large per- 
centage of the sound types that are seen in the pulmonic mechanism. 
Each of the sounds is modified by belch timbre. Many of the sound 
types, however, cannot be distinguished from one another acoustically 
since the belch timbre itself masks out numerous distinctive features 
which could otherwise be observed. The opening into the esophagus 
serves as a vibrator analogous to the vocal cords ; it is difficult if not 
impossible to eliminate its vibration entirely in order to obtain sounds 
perfectly analogous to pulmonic voiceless types. 

Combinations can be made between the esophageal mechanism and 
the oral or pharyngeal one. To obtain descriptions of them one can 
use previous descriptions of combinations of the pulmonic type with 
oral and pharyngeal mechanisms, substituting esophageal mechanism 
and vibration for the pulmonic type with voice. One added combina- 
tion must be mentioned: during the belches air may be escaping from 
the lungs to vibrate the vocal cords and add true voicing to the 
belched stops, fricatives, or vocoids. 

Laryngectomized patients are sometimes taught to use esophagus 
air as a substitute for the lung air which no longer passes through the 
supraglottal cavities. There is some question about the vibrator as 
the voicing substitute in this instance; the claim that the epiglottis 
serves for this is open to question . 3 I have had no opportunity to 
study the voices of such persons directly, but from a sound film 4 it 
appears that any one “ breath group ” is limited to perhaps ten syl- 
lables in the most expert speakers, who use esophagus air perhaps 
exclusively. Less expert speakers use fewer syllables and supplement 
esophagus air with pharynx air for making many voiceless sounds, 
especially stops. 

There is a large group of other minor mechanisms, but none of 
them can produce a great variety of sounds; they are limited to a very 
few stops and fricatives each, with or without lip rounding and with 
other minute modifications. 

3 Cf. Morrison and Fineman (“ Production of Pseudo-voice after Total 
Laryngectomy,” Trans. Am. Acad. Ophthalmol, and Otolaryngol., 41 [1936], 
631) for a statement of various types of vibrators used by these laryngectomized 
patients, Jackson (“ The Voice after Direct Laryngoscopic Operations, Laryngo- 
fissure and Laryngectomy,” Arch, of Otolaryngol., 31 [1940], 35) doubts the 
value of the epiglottis as a vibrator. 

4 See p. 25, n. 97. 
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If the tongue tip is turned down under the tongue in such a way 
that the blade of the tongue touches the lower lip, the separation of 
the tongue (by moving the blade farther back into the mouth) from 
the lip produces a suction sound, an ingressive stop. 

If the lower lip is pulled somewhat into the mouth up over the 
lower teeth and the tip of the tongue is then placed against the lower 



Fig. 9. Minor reversed air mech- 
anism 

A, active air chamber; R, rarefac- 
tion. The arrow shows the direc- 
tion of movement of the initiator 



Fig. 10. Induction mechanism 

G, vibrating vocal cords; IV, in- 
duced vibrator; — induced train 
of sound waves 


teeth and gradually slid up across and off from the lower lip, a dif- 
ferent type of suction sound may be noted (see Fig. 9) ; the mechanism 
is of the reversed type. 

Several varieties of minor mechanisms may be discovered by 
manipulating the tongue in various ways in the bottom of the mouth. 

These minor mechanisms can be combined with a large number 
of lung-air sounds and a few pharynx-air and mouth-air sounds; the 
sounds of the major mechanisms which may combine with these 
mechanisms are in general those which do not need the front part 
of the tongue in their articulation. 

The tongue tip may be used in several other ways, in the upper 
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part of the mouth rather than the lower. It may, for example, be 
placed outside but against the upper teeth, while touching the upper 
lip. If the tongue tip and lip are then opened outward, while the 
blade is moved inward, the blade serves as an initiator for ingressive 
sounds. This mechanism is used frequently to suck the teeth to re- 
move food particles and the like; in this capacity it is also to be 
observed in a modification which places the tongue tip up between the 
teeth and the cheek at the side of the mouth. 

If two closures are made by the tongue, one at the velum and the 
other at the alveolar arch, the movement backward of the back part 
of the tongue causes rarefaction of the air between the two closures. 
But if the tip of the tongue is moved backward it compresses the 
air in that chamber; a release of the tip then allows egressive escape 
of compressed air. Since the initiator is at the front of the chamber, 
the mechanism is of the reversed type. 

When the mouth cavity is almost completely closed by the tongue 
(approximately in position for [t] ), a zero cavity with small bound- 
aries is left between the teeth and cheeks. If the cheeks are drawn 
outward by the fingers they may serve as initiators for rarefactive 
sounds. If the cheeks are first puffed out (with pharynx air or 
mouth air), so that the cavity between the teeth and cheeks is filled 
with air, the cheeks may then by their own power serve as initiators 
of compressive sounds by forcing themselves toward the teeth to drive 
out the air from the cavity. 

Similar sounds can be made by the lips. The oral cavity may be 
closed with the tongue, while the space between the lips and teeth 
is puffed full of air, after which the lips as initiator can be forced in 
toward the teeth to make compressive sounds. Since the initiator 
here is on the outward edge of the air chamber, the mechanism is 
another reversed one. A modification of this mechanism can produce 
a rarefactive sound or two. With the oral cavity closed by the tongue, 
the closed lips are thrust sharply outward and then opened; a slight 
rarefactive sound is heard. The reverse of this may produce a slight 
compressive sound. 

Like other mechanisms, these can enter into various combinations. 
Their sounds can be produced simultaneously with those of pulmonic, 
oral, and pharyngeal major mechanisms, or even with other minor 
mechanisms, provided the various initiators, air chambers, points of 
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articulation, and articulators do not overlap in such a way as to pre- 
vent for mechanical reasons the combination of types already illus- 
trated. 


Percussion Mechanisms 

Just at the time some part of the wall of an air chamber is making 
or breaking a closure it becomes a percussor, if there is no centering or 
outward movement in respect to the chamber during the actual con- 
tact time. Percussors differ from initiators in several ways: in open- 
ing and closing they move perpendicularly to the entrance of the air 
chamber, instead of having a section of the wall of an air chamber 
move for some time toward or away from the center of the chamber ; 
they produce no directional air current, but merely a disturbance that 
starts sound waves which are modified by certain cavity resonators; 
they manifest their releasing or approaching percussive timbre only at 
the moments of the opening and closing of some passage, with no 
possibility of producing protracted compression or rarefaction of air 
and no possibility of continuous sound production. 

When the lips open or close, they are percussors. The tip of the 
tongue making closure on the alveolar arch is one likewise. The velic 
closing the nasal passage, the tongue closing the oral cavity at the 
velum, the vocal cords snapping together for a glottal stop are all 
illustrations of percussive action. Children frequently make a lingual 
percussive by slapping the tongue sharply against the floor of the 
mouth; this is usually done as part of the release movement of a 
lingual click. 5 

Certain percussives tend to be below the threshold of audibility 
unless two conditions are present: (1) the movement must usually be 
rather sharp, forceful, and rapid; (2) the chamber of which they are 
a part must be a completely enclosed body of air, as for a stop. 

The majority of relatively loud types are made by front oral 
percussors when there is a velar closure or when there is both glottal 
and velic closure. This can be seen by making the lips open and close 
in rapid forceful sequence under the following varied conditions: 
first with the glottal and nasal passages open as for a nasalized vowel 

5 Doke (“An outline of the Phonetics of the Language of the Chu: Bush- 
men of North-West Kalahari,” Bantu Studies, 2, No. 3 [1925], 163) reports such 
a click as an abnormality of one speaker. 
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(this usually gives but very slight noise, or no audible noise) ; then 
with velic closure (still almost inaudible, probably entirely so for the 
releasing percussive) ; next with glottal and velic closures (immedi- 
ately audible, for both releasing and approaching percussor) ; finally, 
with oral closures at the velum or farther front (still definitely 
audible). An interesting experiment substantiates the validity of this 
series. Form the same cavities, with the same series of closures; 
instead of closing the lips, snap the cheek sharply with the finger. 
For the type with glottal closure one may obtain similar results by 
snapping the throat instead of the cheek. When the lips are open 
during this snapping the result is similar to the sound of releasing 
percussives; the latter tend to sound briefly vocoidal. If the lips are 
closed the results are more like the closing percussives or like the 
action of a drum when beaten. 

Air chambers which have percussors but no initiators are percus- 
sion mechanisms and have static air rather than an air stream. Any 
passive chamber is a potential percussion mechanism, since it becomes 
an active one if a percussor is added. 

Certain of the percussion mechanisms can enter into combination 
with each other, just as can the air mechanisms. In such groupings, 
however, the oral percussors are usually the only ones audible. If, for 
example, the glottis, velum, velic, and lips are closed, the release of 
the latter two simultaneously usually leaves only the labial one 
audible. 

Percussors and their sounds may be added to all air streams, 
however. The closures and releases of all stops are caused by the 
addition of percussors to air streams. The closures at the start of 
[p], [t], and [k] may be distinctly audible if made sharply enough. 
On the release it is impossible at times to separate the sound of the 
percussor from that of the releasing compressed air. In lung-air 
sounds the releasing percussive is usually inaudible, as we have al- 
ready stated, so one would hardly expect to hear it when the sound 
of the releasing air is produced at the same time. In pharynx-air 
and mouth-air sounds this difficulty is nonexistent, but a new one is 
added: the release of the capped air stream often seems to exaggerate 
the sound of the releasing percussive rather than to make a new sound 
of its own. The release of certain very loud sharp-cut glottalized 
stops gives the acoustic impression of releasing percussives only, 
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except for being louder. (Some percussive- transition vocoids have 
already been discussed; see pp. 91, 93.) 

Percussion mechanisms as a whole can enter into combination with 
air mechanisms. One example of this is seen when velar closure and 
bilabial percussive form an oral percussion mechanism with static air, 
while the lungs, as initiator for a pulmonic air mechanism, thrust air 
out through the nose and vibrating glottis; the opening and closing of 
the lips are clearly audible above the sound of the voiced nasal (but 
compare the induction mechanisms, p. 106). Double stops are some- 
times built up in a way similar to this. The pulmonic mechanism 
gives a pulmonic air stream which vibrates the vocal cords and is 
capped by velic and velar closures; the oral cavity has no air stream 
since the back part of the tongue is stationary, but the cavity is 
turned into a percussion mechanism by the release of the lips. This 
bilabial percussive is added to the sound of the voiced [g] . After the 
labial release the velar closure may release also in the normal manner 
of a releasing voiced velar stop. The grouping of such items into 
single phonemes is a linguistic feature beyond the scope of this dis- 
cussion. 

Bilabial releasing concussion 
of pharyngeal static air 
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Scraping Mechanisms 

Sounds caused by the rubbing of one object against another are 
very rare in vocal types. A few scrapives may be found, however, 
which are produced in this way. The lower teeth being rubbed back 
and forth against the upper ones may produce one such sound. An- 
other can be made by scraping the lower teeth up and down against 
the inside of the lower lip. All such sounds are distinct from fricatives 
(see p. 141), which are caused by friction of air streams. 


Double stop [bg3 , 
plus Of] 
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Induction Mechanisms 

If sound vibrations (from voicing, primarily) are transmitted 
across some closure or wall of a chamber, that section becomes an 
induced vibrator for sounds of static air in an induction mechanism 
(see Fig. io). Such sounds are initiated in an active chamber but 
have their vibrations carried across the chamber wall into a passive 
one. If during the continuous pronunciation of a voiced velar nasal 
the lips are repeatedly opened and partly closed (but not completely 
closed, or percussives will result also; see p. 103), changing induced 
resonances may be detected. If the velic is perfectly closed for the 
production of oral sounds, any vibrations which may be present in 
the nasal cavity are likewise due to an induction mechanism. 

This chapter has shown that most sounds entail the use of some 
type of air stream. Air streams may be either egressive or ingressive. 
Exactly halfway between egressive or ingressive types lie the percus- 
sion mechanisms; if some part of the chamber wall of a percussion 
mechanism begins to move inward or outward, especially the lungs, 
glottal closure, or velar closure, the mechanism immediately becomes 
an active air type. The following chapter will show how the air 
streams initiated in air-stream mechanisms are controlled and mod- 
ified. 
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CONTROLLING MECHANISMS 

Segmentation 

I N CHAPTER III (pp. 42-55) there was discussed the possibility 
of cutting a continuum into phonetic segments which would not be 
phonemes but would reflect a natural segmentation of vocal sound 
production. At this point a statement will be presented delimiting 
such segments, after which the various factors involved will be con- 
sidered separately. 

Statement: A segment is a sound (or lack of sound) having in- 
definite borders but with a center that is produced by a crest or 
trough of stricture during the even motion or pressure of an initiator ; 
in static mechanisms percussives are segmental centers. 

If one says [aiaiaiaia] five troughs and four crests of stricture 
may be observed. A crest of stricture is a point in a continuum where 
the strictures approach more nearly a state of closure than do the 
strictures immediately preceding them in the sequence. The central 
sounds in the following groups of three have crests of stricture: 
[hph], [sth], [aia], A trough of stricture is a point in a sequence 
where the strictures approach more nearly a state of openness than do 
the strictures immediately preceding that point. Examples are the 
central sounds of [php], [eai], [fhs]. 

The addition of any stricture adds a crest segment; illustrations: 
the second sounds in [hp], [nd], [ha]. The subtraction of any 
stricture adds a trough segment, as in [bm], [ea], [th]. The sub- 
stitution of one stricture for another adds a segment, regardless of 
whether or not the second stricture has a greater degree of closure 
than the first or a different function (so long as the same air mech- 
anism is used for each); compare [fs], [pa], [mp]. In a substitu- 
tion, when one stricture is released and another added, the articulator 
which has the higher rank determines whether the segment is a crest 
or a trough. Rank of articulators constitutes a descending series: oral 
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closure, oral fricative stricture, oral frictionless stricture, nasal stric- 
ture, pharyngeal (or glottal) stricture. Crest segments: the second 
sounds in [am], [os], [ap], [mp] ; trough segments: the second 
sounds in [ma], [so], [pa], [pm]. Two strictures approaching 
simultaneously or two releasing simultaneously are double approaches 
and releases, but for segment production they act like single ap- 
proaches and releases. Even though the two strictures of a double 
approach may have different strictural function, the analysis remains 
unchanged. Double releases may be seen in [bM], [eA]. 1 

The function of an initiator is such that its steady movement or 
pressure does not constitute a trough and crest of stricture. In a 
bilabial voiceless click (a kiss), for example, two units may be ob- 
served: the first unit occurs during the labial closure and the second 
immediately after the oral release ; one does not have to add two more, 
one for the point at which the initiator starts moving, and another 
when it ceases. Nevertheless, from the articulatory point of view, 
the movement (or pressure after the capped air stream has equalized 
air pressures) of the initiator allows the voiceless stops to be classed 
as phonetic entities since they include the action of a phonetic mech- 
anism. 

If, however, the initiator does not move steadily, but slows down 
and then speeds up its movement, or decreases and then increases 
pressure, two unit crests will be detected. Thus if one pronounces a 
long continued [a], only one unit is present, but if during that sound 
the lungs give jerky, unsteady pressure (several chest pulses within 
the single qualitative sound), numerous crests of sound will constitute 
segment centers. 

In the application of this segmentation procedure a phonetician 
immediately faces a complicating factor. Within the identical se- 
quences of sound waves from a single group of articulatory movements 
one observer may hear a crest or trough that another investigator does 
not notice. Persons do not hear the same items even though listening 
to the same data. Certain factors contributing to this difference can 
be demonstrated. The longer a crest is maintained, the more readily 
one hears it; the louder the segment, the more likely it is to be ob- 
served. The duration and intensity of sounds influence their percepti- 
bility. A person tends to hear things which are significant to his own 
1 The capital letters represent voiceless sounds. 
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sound system, whereas he may not notice things which are extraneous 
to that system: phonemic background affects one’s perception of 
sound (e.g. a person may fail to note the presence of a glottal stop if 
it is not phonemic in his language). He can also hear items more 
readily if his attention is directed specifically toward them. Phonetic 
training increases the number of sounds which a person tends to notice. 
It does so by a contrastive procedure similar to that of phonemics in 
language. The instructor’s ear and his marking of papers substitute 
for social judgment and reaction. The quality of sounds affects their 
perceptibility; the more two sounds resemble one another and the 
closer their points of articulation and types of productive mechanism, 
the more difficult they are to tell apart (e.g. two types of [t] which 
differ in production only in that one is alveolar and the other dental 
might be distinguished by the first of two observers but not by the 
second, since the sounds are acoustically very similar). The sensi- 
tivity of the mechanism recording the sounds or measuring the move- 
ments will affect the number of segments analyzed in an utterance. 
A deaf person will hear fewer changes in sound and stricture than will 
a person of normal hearing; a normal person can hear fewer sounds 
than can a person with exceptionally keen physiological equipment. 

An instrumental segment is one which would be detected were 
instruments applied to the segmental analysis of a continuum. In 
this respect instruments act like ears of greater or less sensitivity; in- 
struments of different degrees of sensitivity give different results. 
Every minute change of the crest or trough of stricture, resulting 
from the addition or subtraction of a stricture, and the like, produces 
a new instrumental segment even if it is below the threshold of per- 
ception, as frequently happens because of the lack of sensitivity of 
the ear. 

When the instrumentalist has by various experiments measured 
the threshold of perception for strictural changes and established 
norms for the average ear, then real (or audible ) segments can be 
delimited. A real segment is one which the average ear is physio- 
logically capable of perceiving (i.e. after phonetic training, when 
its attention is directed to the segment, and when no phonemic factors 
interfere). 

Even if the real segments in an utterance could be determined by 
controlled experimental conditions, phoneticians would not hear the 
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same number of real segments under normal conditions of transcrip- 
tion. Differences in attention given to particular parts of a sequence, 
phonetic training, and phonemic background (especially the influence 
of one’s native language) would affect the transcription. The actual 
outcome, therefore, is the marking off of perceptual segments. These 
are the results obtained by a phonetician who applies under normal 
field conditions the methodological statement already given. Just 
how near perceptual segments approach real segments must await an 
instrumental study, but the correlation apparently is quite close for 
trained phoneticians — close enough for perceptual segmentation to 
become an adequate working basis for the average situation in which 
phonetics is utilized. On the other hand, instrumental segmentation 
is frequently below the threshold for real segments; it can hinder the 
discovery of phonemic patterns if applied directly to the search for 
the phonemes themselves rather than to a description of the acoustic 
basis of perceptual segments and phonemes independently ascertained 
by linguistic technics. 

Glides present a related problem. An isolated sound may have no 
glide at all and still be a level segment if there is no movement what- 
ever of the vocal apparatus apart from that of some initiator; [s] and 
[f] might at times be level units. Glides must be present between 
segments since any approach or release involves one. The audibility 
of these glides varies greatly and is dependent upon much the same 
factors as those which influence the perceptibility of segments. The 
longer the duration of a glide, the slower its rate of movement, the 
greater the qualitative change, the greater the sensitivity of the ear, 
and the more a glide resembles some phonemic feature of the observ- 
er’s language or some item included in his phonetic training, the 
greater become the possibility and probability of his hearing it. In 
[ia], for example, the crest and trough of stricture (the [i] and [a] 
respectively) may be made very long in contrast to the glide between 
them ; in that case the glide will be practically or completely inaudible. 
If, however, the initial crest is of very brief duration, whereas the 
glide following it is relatively long, the glide will be heard by speakers 
of English both because of its duration and because of its significance 
to the consonantal phonemic pattern of English (producing [ya] ). 

A single-direction glide which starts at a certain position and ap- 
proaches to reach a crest, or releases to reach a trough, has two seg- 
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ments ; whether these two segmental centers are perceptual or real or 
instrumental will depend upon factors already discussed. A two- 
direction glide has at least three segments: if the stricture first moves 
closer and then opens, two troughs are present with one crest of stric- 
ture between them, but if the stricture first opens and then closes, a 
crest occurs between two troughs. Single-direction glides include [ai] , 
[ia], [hp], [ph], [st], and the like; two-direction glides include 
[yae], [aia], [iai], and so on. 

When the releasing (single-direction) glide of one stricture is 
simultaneous with the approaching glide of another, a crossing glide 
(see p. 1 14) is developed. If the approach be diagrammed by / and 
the release by \, then the crossing glide may be diagrammed as X > 
the central point represents a trough segment which will be perceptible 
in proportion to the status of the variables already mentioned for 
glides and perceptual segments. One such audible crossing glide is 
the sound of open transition between two voiceless or voiced stops; 
in [apta] a type of crossing-glide aspiration may be heard if the lips 
begin to open while the tongue is moving toward the alveolar arch 
(but if the first glide is completed before the second is begun, a dif- 
ferent type of aspiration follows, a simple [h] — a trough segment, 
but not a crossing glide). Voiced open transition between [b] and 
[d] may give similar results. The crossing glide between [m] and 
[p] is not audible; the stopping of vocal-cord vibration and the ap- 
proach for the velic closure seem to be too abrupt to allow for percepti- 
bility of the glide. Presumably differences in strictural function will 
at times affect the perceptibility of one or the other of the two elements 
in crossing glides, and will affect the perceptual segmental interpreta- 
tion accordingly. 

In the same way that an initiator must have steady movement 
(see p. 108) or else cause extra segments, so also must other strictures 
with different functions. If during any single-direction glide the 
articulator goes out of a relatively direct line to its goal, the point of 
greatest divergence from a direct norm constitutes a bulge of stricture 
which causes a segmental crest; see Figure 1 1. Thus, if during a glide 
from [i] to [a] the tongue sweeps backward to a mid-back position, a 
crest is formed between the trough for [a] and the crest for [i] which 
is qualitatively an unrounded back vocoid. 

A time bulge is formed when during a glide the speed of move- 
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ment is suddenly reduced (and often, but not necessarily, restored 
after a short interval); see Figure 12. For example, if in gliding 
from [a] to [i] the speed of opening is reduced as the tongue ap- 


b 



Fig. 11. Bulge of stricture 


A-B, directional movement of approaching articulator; A, 
initial position; B, final position; a, trough; b, bulge; 
c, crest 

proaches a mid position, a mid-front vocoid crest results. A similar 
time bulge may be seen in the pitch of a single vocoid. If the vocoid 
is begun at ‘ do ’ and the pitch steadily (rapidly or slowly) rises 
toward 1 sol,’ two segments would be observed, first a trough (at the 



Fig. 12. Time bulge 

A-B, movement of an approaching articulator; A, initial 
position; B, final position; a, trough; b, time bulge; 
c, crest 

beginning) and then a crest (at the end); if during the rising glide 
the speed of the change of pitch should be reduced at a median point, 
a crest might be observed approximately at ‘ mi.’ 

When one hears a crest of sound during a release, it is sometimes 
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difficult to determine whether the bulge is caused by change of speed 
or by change of the direction of movement. In a shift from [p] to 
[h] a quick release allows no perceptible sound between. A very 
slow steady release makes possible the perception of a gradual change 
of types of bilabial spirant, but no bulge crest. If the speed is slowed 
down, so that there seems to be a slight pause or hesitation of move- 
ment during this release, then one hears a crest which is some type of 
bilabial voiceless fricative [0]. The process of release of the lips 
first and then, later, the lowering of the jaw, also adds a segment 
between [p] and [h], because each of the two releasing movements 
produces a segment. 

The change from [t] to [h] presents a complication. Similar 
transition sounds may be observed in [th] as in [ph] (no intermediate 
sound audible, or steady nonsegmental audible glide, or glide crest if 
movement is slowed down to allow an alveolar fricative of some sort). 
None of these are [tsh]. If [ts] starts from identically the same 
position as does [t] without the sibilant, then the [s] might be con- 
sidered to be formed by a bulge crest of the type which has nondirect 
glide (since the direct glide does not produce [s] even when slow) ; 
but it seems preferable to consider [s] a crest formed by a substituted 
stricture, since the section of the tongue back of the tip has to rise 
slightly to take its sibilant position while the tongue tip drops. If, on 
the other hand, the [t] starts with the blade in the exact position to 
anticipate the position for the [s], there will be no rise of the blade 
to constitute a crest. In this case one can state that the tip and blade 
strictures form the crest for the first segment; the tip releases to 
leave only the blade stricture for the second crest, and then the blade 
in turn releases to the trough segment [h] . This appears preferable 
to describing the movement as the release of the tip by an indirect 
route which gives a bulge heard as [s] . When one starts from a [t] 
which anticipates the blade position for [s] but releases both blade 
and tip as a whole, a fricative different in quality from any so far 
mentioned can be heard as a time bulge if the speed is slowed down ; 
this fricative is quite unlike the sibilant [s] after [t] . If this whole 
blade section and tip are swept backward into the mouth as they 
descend gradually, an entirely different medial fricative crest is ob- 
tained by the nondirect movement. 

It proves convenient in analyzing a sequence of segments to have 
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names for certain types of groups. A presequential is composed of two 
segments, the second of which is, in articulatory positions, identical 
with the first except for one added stricture (e.g. [ hp] , [ai] ) . A post- 
sequential is the reverse: a release of one stricture of the first segment 
produces the second (e.g. [ts], [bp], [dn]). In a prepostsequential 
the second segment is formed by an approaching stricture and the 
third by the release of a stricture (e.g. [aie], [sth], [abm] ) ; a post- 
presequential is the reverse of this type. If simultaneous double re- 
leases (or approaches) are used the same terms may be applied, since 
in segmental delineation one is interested in locating crests and troughs 
of stricture, not in describing the nature of the segments. An illustra- 
tion of a presequential using a double approach would be [hb J , in 
which vocal cords and lips cause strictures at the same time. In a 
prepresequential two consecutive (not simultaneous) approaches form 
the second and third segments (as in [hpb] ) ; a postpostsequential 
entails three segments with two consecutive releases producing the 
second and third segments (e.g. [eah]). A substitution sequence is 
composed of two crests with a crossing glide between (e.g. [pt] with 
rapid, but open, transition; cf., also, [eu]). 

When, in a particular language, two segments combine into a 
single phoneme, the two segments chosen, if consonantal, are almost 
always in postsequential sequence; the most prominent exceptions 
are presequential sequences of nasal plus stop, like [nd]. Vocoid 
types may have diphthongs, like [ai] , which are presequentials. Occa- 
sionally that which is perceptually a single segment may be two 
phonemes; this is frequently true of long contoids or vocoids. If 
voiceless stops with close transition have nothing audible during the 
closure, they also may be single perceptual segments working as two 
phonemes. Often, however, close transition of [pt] is not a single real 
segment, since the release of the lips gives a tiny audible percussive 
sound whose resonance chamber is the small static one in front of the 
lingual closure. In any case, this release of a stricture and the previ- 
ous closure of the lingual one would add instrumental segments, if 
not perceptual ones. 

When some percussor causes sound vibrations with a static cham- 
ber, that sound is considered to be a segmental center. Thus the 
sudden closing and opening of the lips (while a velar closure is held, 
in order to provide static mouth air) would give repeated segments. 
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When the sounds from the percussors are superimposed upon the 
sound of an air mechanism (as with [p] , [t] , [k] ) , it does not prove 
convenient to analyze them as separate segments. The moment of 
reinforcement of the sound of an air mechanism by a percussive may 
be called a percussive transition from one segment to the next (see pp. 
9i, 93-94, 104-5). 

After the crests and troughs in a sequence or a large number of 
sequences have been marked off to show the centers of the segments, 
each of the segments may be classified according to the type of mech- 
anism producing it and the articulatory modifications which are added 
to its air stream. Whenever two segments, whether contiguous or 
noncontiguous, are produced by the same articulatory method and are 
acoustically the same, they are the same phonetic unit, or phone. An 
instrumental phone is a phone identified or identifiable by some instru- 
mental means; repeated contiguous or noncontiguous utterances of 
the same instrumental phone will (by definition) be found identical, 
within the range of sensitivity of some particular instrument. A real 
phone is one which the average normal ear, after training, elimination 
of phonemic prejudice, and so on, would identify, or be physiologi- 
cally capable of identifying ; in repetitions of a real phone any varia- 
tion detectable only by instruments is below the threshold of 
perceptual ability of the ear. A perceptual phone is one which a 
particular ear at a particular time believes it identifies; repeated 
utterances which are to a particular observer occurrences of the same 
perceptual phone, may to someone else be different perceptual phones ; 
and even to the same observer at a different time (especially after 
phonetic training) they may “ sound different,” that is, be different 
perceptual phones for him also. 

The perceptual phones of the phonetic layman always contain 
many more real and instrumental phones than the perceptual phones 
of the trained individual. A set of real phones might line up into 
more than one set of perceptual phones in different phonetic contexts. 
Before training, a speaker of English is likely not to notice varieties 
of his [k] phoneme; simply because they are members of a phoneme 
they would tend to be members of a perceptual phone. After phonetic 
training he may learn to recognize differences due to position of the 
closure relatively far front or back in the mouth, and so on, and real 
phones then become different perceptual phones. The problem of 
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identifying members of phones is largely one of comparing the acoustic 
quality of sounds and the articulatory method of their production. 
This is not true of segmentation, which ignores quality as such and is 
concerned only with the fact that the quality has changed because 
of strictural movements. 

Perceptual and real segments in a specific continuum overlap so 
as to be almost identical; perceptual and real phones in the same 
continuum overlap much less. Languages agree very well on the 
places at which segmentation shall occur (except for the fluctuant 
points; see p. 46), but they have an infinite variety of phones of 
minute qualitative differences. Probably instrumental segments 
would be found not to differ very strikingly from real segments ; one 
might almost say that instrumental phones for a very fine instrument 
must consist of unique members (since no two sounds tend to be 
exactly alike) ; they therefore differ markedly from real phones and 
tremendously from perceptual ones, since the latter are composed of 
many instrumentally different members. One can speak of a natural 
segmentation, since the breaks seem to be quite similar in all lan- 
guages, but one may not speak of a natural choice of phones for 
languages, since languages employ a great variety. 

A speech sound is, generally speaking, that part of an utterance 
abstracted from speech which consists of no less than one segment and 
no more segments (rarely more than two, perhaps never more than 
four or even three) than may be combined into single phonemes. 

Any part of a continuum, chosen for articulatory analysis, which 
contains less than a segment is a phonetic fraction (see p. 52 ) . These 
fractions may be described according to their productive mechanisms, 
but they lack a place in the functioning units of sounds in speech. 

A sound may be either a speech sound, a phone, or a phonetic 
fraction; the term is ambiguous but convenient. 

A syllable is a single unit of movement of the lung initiator (but 
for syllables from other initiators see p. 118) which includes but one 
crest of speed (cf. p. 108). Every occurrence of an initiator time 
bulge (see pp. m-13 and Fig. 12, p. 112) followed by renewed speed 
of the initiator movement is a trough or border between two syllables. 
Physiologically, syllables may also be called chest pulses (see pp. 
53-54). Instruments detecting these units locate instrumental syl- 
lables. Real syllables are those which the ear is physiologically 
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capable of distinguishing. Perceptual syllables are those which an 
investigator actually notices at some particular time. 

The segment during which the speed of the initiator movement is 
the greatest in the syllable is the syllabic; a few exceptions (e.g. in 
[ha]) seem to be due to an acoustic subcriterion of prominence (see 
p. 1 18). All other segments in the syllable are nonsyllabics. The 
syllabic may be considered the functional center, nucleus, or crest of 
the syllable. It is not always possible to determine where the trough 
of initiating movement which separates two syllables occurs; should 
the trough fall within a long contoid or vocoid, the sound is composed 
of two segments. 

Syllable units and segment units are somewhat interdependent. 
Every close stricture within a continuum tends to prevent the egress 
of the air stream initiated by the pulmonic movement, and air pres- 
sure is built up behind these complete or partial strictures. The 
pressure accumulated in this way tends to slow down the movement 
of the initiator or to stop it entirely. When the strictures are released 
the pressure is lessened, and the speed of the initiator movement in- 
creases. These alternations of initiator movement constitute the syl- 
lable pulse. The movement of the initiator can also be slowed down 
by its own sets of muscles, rather than by the resistance of air 
pressure ; this type of time-bulge alternation fuses into a single system 
with that produced because of changes in the air pressure. The 
majority of syllables are caused by changes of strictural interference 
with the air stream and initiator movement; but, on the other hand, 
the strictures by themselves could give rise to very little sound (except 
for the percussives) if the initiator sent no air stream to cause vibra- 
tions at those points. 

Strictures can be produced in isolation with a single movement 
of the initiator and without syllable pulsations. For example, an 
[Is] can be made and continued with no perceptible syllable divisions 
within it. On the other hand, syllable pulsations can be produced 
in an isolated sound, e.g. [a] , without the aid of strictural changes. 
The systems of articulation and syllable movement can therefore be 
operated somewhat independently. For this reason also they can be 
operated simultaneously, superimposing their features one upon an- 
other. The description of the articulatory and acoustic character- 
istics of a phone must therefore be given without reference to its place 
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in any syllable, since its syllable characteristics are subject to change 
depending upon the particular kind of syllable it happens to be in, 
and upon whether or not the phone functions as a syllabic. Its syl- 
lable function can later be stated independently as something super- 
imposed upon its innate articulatory characteristics. 

The articulatory features of the syllable are of course reflected in 
acoustic quality and in the articulatory nature of the syllabic. Since 
the trough of the syllable division tends to come at the region of 
heaviest air pressure, which is in turn partly conditioned by the degree 
to which strictures close off the air stream, segments which have the 
higher degrees of stricture and interrupt the air stream the most tend 
to be nonsyllabic; the nonsyllabic group, therefore, includes stops 
and fricatives more often than vocoids, although voiceless vocoids are 
frequent exceptions. Since sonority is to some extent conditioned by 
the size of the air chamber and by the lack of interference with the 
air stream, the contoid group also tends to be less sonorous than do 
the syllabics. Loudness is conditioned by the degree and type of 
interfering stricture within the structure of the segment and by the 
speed and pressure of the initiator; both of these factors tend to make 
syllabics louder than nonsyllabics. If sounds are uttered very rapidly 
together the ear tends to hear fewer perceptual syllables, since it 
cannot pick up the changes in the speed of the initiator movement 
even though they may be there ; if sounds are given more slowly, and 
the duration of the retention of their strictures is increased, the ear 
is better able to pick up the changes in the initiator movement, and 
therefore more perceptual syllables are acoustically recorded. Fea- 
tures of loudness, sonority, and duration tend to make segments prom- 
inent; since syllabics generally have these features, prominence is 
likely to be the major acoustic characteristic of a syllabic. In pho- 
netic analysis, therefore, one legitimately interprets alternations of 
prominence in terms of alternations of perceptual syllables. 

So far in this discussion the only initiator which has been con- 
sidered in syllable division has been the pulmonic one. If, however, 
sequences of sounds are made with any other initiator without the 
admixture of sounds produced from a pulmonic air stream, the syllabic 
sequences are produced by alternations of pulsations of that initiator. 
If, for example, the breath is held (to insure the quiescence of the 
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pulmonic initiator) and a series of sounds is made with pharynx air, 
a sequence of syllables will be noted which are conditioned by the 
pulse of the laryngeal movement. 

Whenever sounds produced from nonpulmonic initiators are in- 
cluded in a continuum which has sounds with lung air, the syllable 
pulse is given only by the pulmonic initiator, whereas the segments 
from the other initiators merely produce crests and troughs of stric- 
ture for the pulmonic sequence. A glottalized [t] , for example, acts 
like any other type of [t] in syllable alternation if it is surrounded by 
lung-air vocoids. 

A syllable is checked (or closed) whenever it ends in a contoid, 
but especially when the initiator movement is slowed down because 
of pressure of the air stream built up by resistance at an arresting 
segment or at groups of arresting segments functioning together. A 
syllable is free (or open) when it ends in a vocoid. The word 1 hast ’ 
is a syllable checked by the arresting group [st] . The word ‘ spa ’ is 
a free syllable. A trigger segment (or group of segments acting as 
a whole) is one during which pressure is building up behind the stric- 
tures following a syllable trough and just before a syllable crest, in 
preparation for the latter; upon release of the segment the initiator 
movement speeds up because of the smaller degree of resisting pressure 
during the syllabic. The initial contoids in the following words are 
trigger segments: ‘ buy,’ ‘ sell,’ ‘ steal.’ The medial contoids in the 
following words are trigger segments: ‘ receive,’ ‘ pronoun.’ 

A stress group is a sequence of several syllables one of which, the 
stressed syllable, has a much stronger initiator pressure than do the 
others. This is not the simple result of pressure built up behind 
trigger segments. The location of the stressed syllable within the 
stress group is not determined by any articulatory features (and for 
that reason may be determined by the pattern of a particular linguistic 
system). 

The smallest natural units of sound in a phonetic continuum are 
the segments ; these are determined mostly by sequences of strictural 
movement. A larger unit of sound is the syllable, which is partly 
determined by the nature of the segment strictures and partly free 
in that the initiator itself may make time bulges by its movement. 
The largest unit is the stress group, whose size phonetically is almost 
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completely free from determination by the segment strictures. For 
each type of unit the centers tend to be definitely marked, but the 
borders tend to be vague. 


Articulation 

ARTICULATORS 

At the time in the production of some sound when any moveable 
part of the vocal apparatus causes any stricture (the partial or com- 
plete closure of an air passage) it becomes an articulator unless it is a 



Fig. 13. Potential articulators and points of articulation 

Reading from left to right along the top of the mouth: l, lips; t, teeth ; a, alveolar 
arch; p, palate; v, velum; u uvula; reading from left to right along the 
bottom of the mouth: l, lips, t, teeth; tt, tongue tip; tbl, tongue blade; tm, 
tongue middle; tb, tongue back; tr, tongue root; e, epiglottis; reading from 
top to bottom in the nasal and pharyngeal cavities: ve, velic; f, faucal pillars; 
es, esophagus ; jv, false vocal cords ; vc, vocal cords ; ac, arytenoid cartilages 
Note — -f, fv, vc, and ac are not visible in the diagram. For a diagram of the 
vocal cords see Passy, The Sounds of the French Language, facing page 7. 
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closure performing the function of an initiator. All moveable parts of 
the vocal apparatus are potential articulators, since they may become 
active ones by fulfilling the conditions just mentioned. Potential 
articulators comprise the lips (separate or together), the lower jaw 
(either in the approximation of the lower to the upper teeth or in mak- 
ing smaller the oral cavity by closing the mouth), the tongue (which 
is divided into several articulators, since various parts may cause 
strictures separately; tip, blade, middle, back, and root are con- 
venient arbitrary points of reference for these positions), the velum, 
the uvula, the velic (see p. 58), the jaucal pillars (including with their 
articulation certain other factors yet unanalyzed, such as a type of 
lower pharyngeal constriction, glottal tension, and usually a raising 
of the larynx), the epiglottis, the pharynx (the root of the tongue 
moving backward toward some part of the pharyngeal wall, or the 
sides of the passage contracting, or both operations taking place at 
once), the vocal cords (sometimes known more graphically as “ vocal 
folds ”), the arytenoid cartilages, and the esophageal wall at its open- 
ing into the pharynx; the walls of the lungs and the esophagus or 
stomach act as initiators but not articulators. The false vocal cords 
may be potential articulators; at the present time, however, the writer 
has not gained any perceptual mastery over them. Presumably they 
act concomitantly with pharyngeal movement of some type, and are 
grouped with such movement by the imitation-label technic. This 
is largely true of movement of the epiglottis also. 

Most of these items are described in practically every phonetic 
book, so that no elaborate descriptive statement is needed here. For 
a diagram labeling these articulators see Figure 13. 

SHAPES OF ARTICULATORS 

A cross section of an articulator from one side to the other may 
show that for a particular sound it is relatively flat, as for an inter- 
dental fricative [0] ; grooved, as for a sibilant, [s] ; rounded, when 
both upper and lower articulators are grooved to make a relatively 
round aperture; convex; contracted, for one type of [r] which is not 
retroflexed; expanded, as for an [i] in which the sides of the tongue 
push out slightly between the teeth. 

From front to back an articulator may be straight; cupped (though 
it is generally more convenient to consider the front and back parts 
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of the tongue to be separate articulators when cupped, as for a velar- 
ized [t] ) ; retro flexed (used only for the tongue; the tip is raised, as 
for a domal [n] ) ; humped, as for [i] ; retracted, as for a pharyngeal 
stop; protruded, as for interdental [t] ; central, with front escape of 
air, as for [s] ; lateral, with side escape of air, as for [1] . 

An articulator may combine shape from side to side with shape 
from front to back. Thus in an interdental [s] the tongue is pro- 
truded and grooved, in contrast with the same articulator in inter- 
dental [0], where it is protruded and flat, or in contrast to an inter- 
dental labial (lower lip between the two sets of teeth, not just against 
the upper ones) , where a different articulator is used, one whose shape 
is retracted and flat. Any part of the wall of an air chamber, whether 
initiator, percussor, or articulator, which is elsewhere than in a per- 
pendicular line to its normal position at rest, is displaced; this includes 
all protruded or retracted articulators. When one articulator is dis- 
placed it may involve the displacement of a second, and prevent its 
normal functioning. When the tongue root, for example, is articu- 
lating near or against the back pharyngeal wall, normal front vocoidal 
positions cannot be taken by the front of the tongue. 

In addition to the shapes already mentioned there can be made 
various types of distortion of some articulator which may produce 
slightly different nonspeech sounds. One of these is the tongue twisted 
lengthwise so that the tip is upside down, for an interdental stop. 

POINTS OF ARTICULATION 

Any section of the wall of an air chamber which is accessible to the 
contact or near contact of potential articulators is a potential point 
(or region) of articulation, and becomes an active one when any stric- 
ture is made at that point. Certain points of articulation are station- 
ary parts of the vocal mechanism: teeth, alveolar arch (which might 
with more justice be called the gingival one, since the contact is made 
against the gum, not the bone ; the more traditional term is retained 
here for convenience, and applies to the upper gum only) , and palate 
(used here for the hard palate only, not the soft palate, which is 
labeled “ velum ”). See Figure 13. 

When two articulators together make contract or partial stricture, 
this also is called a point of articulation. The two lips working to- 
gether, the two vocal cords in joint operation, the back part of the 
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tongue in conjunction with the velum, and the like, are points of 
articulation. 

If there were no displaced articulators every articulator would 
have one, and only one, point of articulation. A few terms would then 
serve to describe all the combinations. A number of labels are con- 
veniently used under the assumption of a normal position for an 
articulator in its normal point of articulation. These combinations 
are: bilabial (both lips), alveolar (tongue tip and gum), palatal 
(tongue middle and hard palate), velar (tongue back and velum), 
uvular (uvula and tongue back), velic (velic and wall of naso- 
pharynx), pharyngeal (pharyngeal wall and tongue root), faucal 
(faucal pillars), epiglottal (epiglottis and pharyngeal wall), glottal 
(both vocal cords), and esophageal (wall of the orifice of the esopha- 
gus and wall of the pharynx). Terms might be given which show 
the dual nature of such articulation. If the articulator furnishes the 
first item, the point of articulation the second, some of those terms 
could be “ labiolabial,” “ linguoalveolar,” “ linguovelar,” “ epiglotto- 
pharyngeal,” “ glottoglottal,” and so on. This hardly seems neces- 
sary, provided the assumptions are clearly stated which affect the 
labels first presented. 

When articulators are displaced, some new way must be found to 
name their relation to a point of articulation. The type of dual label 
suggested but discarded for the group above is conveniently used for 
the displaced set: the labiodental position (lower lip, upper teeth) is 
an example. Terms like “ labiovelar ” do not mean an articulator 
plus a point of articulation, but indicate a velar sound with labial 
modification; in the terminology for this study that sound would be 
called a “ labialized velar ” in order to avoid the difficulty. 

Such double terms could well be extended to other sounds. Tradi- 
tional terminology, however, tends to use labels which assume the 
proper relationship of articulator and point of articulation without 
stating both. For this reason an interdental sound is one in which the 
tip of the tongue is placed between the upper and lower teeth (not a 
sound made by both sets of teeth touching one another without lingual 
influence) ; a dental sound is one in which the tongue tip touches the 
backs of the teeth, usually the upper ones; a domal sound (i.e. cere- 
bral, or cacuminal) implies that the tongue tip articulates somewhere 
behind the alveolar arch; a retroflex sound (the term is used more 
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or less synonymously with “ domal ”) is like a domal sound except 
that vocoids are modified by retroflexion, not by domal articulation. 

This list does not exhaust the possible points of articulation or 
their relation to possible articulators. Theoretically a section on the 
hard palate or the velum, or the like, could be divided up into an 
infinite number of points at which an infinite number of parts of the 
tongue could in turn produce an unlimited number of sounds. Grant- 
ing that to be true, the total number must then be reduced to a rela- 
tively small group because of the perceptual factor. An untrained 
person could identify only a few of these sounds if they were uttered 
a short time apart; if they were given one right after the other, he 
could distinguish many more, but still a very limited number. For 
this reason it cannot be acceptable phonetic procedure to act as if 
the number were infinite. Much preferable is actual phonetic prac- 
tice, which arbitrarily fixes a few norms for points of articulation 
and handles any perceptible deviation from those norms in terms of 
departures from them. 

The choice of particular points on the velum and palate as norms 
for the place of production of sound types is highly arbitrary, since 
these points and others which might have been chosen give sounds 
which have similar productive mechanisms and acoustic properties. 
The choice of norms is not so arbitrary when it establishes divisions 
between sounds made with the lips, the tip of the tongue, and the 
middle of the tongue. Sounds made with the lips are productively 
and acoustically quite different from types made with the tip of the 
tongue in normal position, and both are different from types made 
with the middle of the tongue, or the glottis, and so on (although just 
at the border between them they may shade into one another). For 
this reason the traditional grouping of stops into types of [p], [It], 
[k] is quite justifiable. 

TYPES OF ARTICULATION 

Normal articulation is that employed for the vast majority of 
sounds (e.g. [i], [s], [p] ) ; in it an articulator approaches its point 
of articulation and, having reached a certain stage of stricture, may 
maintain that position for a longer or a shorter period of time before 
releasing. 

In flap articulation the articulator gives one rapid tap against its 
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articulating region and then immediately releases; approach and re- 
lease together are formed by a single ballistic movement. A flap 
differs from normal articulation in that the stricture cannot be retained 
longer than is necessary for the quick flipping contact. Acoustically 
the percussive is very prominent in comparison with the air stream. 
Most flaps completely close the passageway of some cavity. Samples 
of flap articulation can be seen in the central contoids of American 
English ‘ matter,’ or Spanish pero, ‘ but.’ Laterals may have the 
central contact given and released as a quick flap. One hears but 
rarely of any flaps except those with the tongue tip articulating against 
the alveolar arch, but others with the same type of quick percussive 
effect are possible. A labioalveolar flap in which the lower lip taps 
against the upper gum is one of these. 

Iterative articulation is formed by the repeated, rapid, and auto- 
matic approach and release of some stricture. Three general types 
can be mentioned: chatters, trills, and vibratory trills. 

Chatters are the repeated rapid partial formation of a stricture; 
at their segment crests they do not completely close the air passage. 
When the teeth chatter, for instance, they may make contact and 
cause percussive sounds, but they do not prevent a continuous simul- 
taneous egressive or ingressive air stream which escapes through 
their interstices, even during the contact of their biting surfaces, and 
causes a fricative chatter. The jaw may flutter rapidly up and down, 
causing neither dental contact nor fricative partial stricture, and 
produce chattered vocoids. The abdominal muscles may give rapid 
pulsations and cause syllabically chattered continuants of any type 
within the oral, nasal, and pharyngeal cavities (note some types of 
laughter). 

Trills are iterative strictures of a kind which completely close the 
air passage within which they occur ; within a trill the separate pulsa- 
tions or contacts are audible. Trills may be considered automatic 
repetitions of flap articulation. They may be made by the lips, cheek, 
tongue tip (as in Spanish), side of the tongue (these lateral trills are 
usually quite fricative), uvula (as in the commoner pronunciation of 
French or German), velic (in a snore; usually with ingressive lung 
air through both oral and nasal cavities), esophageal opening (in belch 
sounds; often the separate pulsations are not clearly audible, so that 
the trill approaches the vibratory type), and vocal cords (trills so 
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made are often heard in the speech of people who are talking in a 
tone of voice very low as compared with normal style; in this case 
the glottal trill is substituted for voice; children often make the sound 
for self-amusement ; the attainment of the sound made by “ singing 
two notes lower than you can ” tends to give one pitch of the trill) . 
Sounds which I had for some time been tentatively suggesting were 
epiglottal trills have proved to be pharyngeal, since while making 
the sounds one can touch with his finger the stationary tip of the 
epiglottis (I am indebted to Dr. Fritz Frauchiger, of the University 
of Oklahoma, for aid in this discovery) ; a view through a dentist’s 
mirror shows the pharynx immediately above the epiglottis contracted 
to the size of a lead pencil, while the soft walls of the pharynx at that 
point are flapping, with saliva completing the closure perhaps some- 
what as water is used in a gargle. 

Vibratory trills automatically, repeatedly, and completely close 
an air passage, but the separate percussive sounds are never clearly 
audible, and usually are not audible at all. Such trills may be pro- 
duced at the lips most easily if a small passageway is provided by 
continued firm contact at one or both sides of the lips; a mechanism 
related to this seems to be the type used by players of brass instru- 
ments (for example, a cornet) to initiate the sound vibrations of lip 
“ voice ” which enter the metal resonators. In a similar way, voice 
is a vibratory trill produced by the vocal cords and modified by 
pharyngeal, nasal, and oral mechanisms. 

The exact relation between trills and vibratory trills cannot be 
analyzed by the auditory articulation technic since the ear cannot 
catch the separate factors in vibration. One interesting analogy can 
be mentioned; if a stick is drawn slowly along a picket fence, the 
separate percussions are obvious; if the stick is drawn very rapidly, 
the percussions tend to merge into a steady tone. 

This analogy will not explain the relationship between voice and 
trillization (glottal trill modified by pharyngeal, nasal, and oral cav- 
ities). The second cannot be simply a slow form of the first. The 
following data apply to the sounds as made by the writer, but may 
have to be modified for other speakers. 

If voice is produced with some accompanying oral formation and 
then the sound is whispered or breathed with as little change of glottal 
position as possible apart from the unvoicing, a markedly different 
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kind of whisper is attained in comparison with that which results from 
the same procedure if one starts with trillization, which cuts off 
a large percentage of the escaping air. This parallels the fact that 
an alveolar sibilant [z] with voice is plainly heard, but the same oral 
formation with trillization results in little or no sibilant (unless the 
stricture is made smaller) because of the weak air stream. Kinesthet- 
ically, the author judges that in passing from voice to trillization 
the posterior section of the glottis is closed; in passing from trilliza- 
tion to voice a relaxation takes place which allows the posterior sec- 
tion to vibrate, rather than the anterior section only. This statement 
has not been checked satisfactorily with instrumental research by the 
author, and quite probably will be subject to modification. 

Another major evidence that trillization is not slow voice is given 
by permitted pitches. If glottal trill were slow voice analogous to 
the slow movement of a stick on a picket fence, then the highest pitch 
of trillization would be lower than the lowest pitch of voice. This 
is not the case. The pitches of both types can be controlled separately; 
trillization can be given in both low and high pitches, in approximately 
the same range as voice (possibly voice tends to be a trifle higher 
than the analogous position for trillization) ; one can sing up and 
down the scale in either. If one starts with a glottal trill, the addi- 
tion of more tension and a more powerful air stream does not raise 
the pitch of the trill to turn it into voice, but tends simply to shut 
off the glottis completely and stop all sound; a glottal trill which 
starts with very strong lung pressure and tension must have definite 
relaxation (rather than further tension) of the vocal cords before 
they can produce voice. 

If the two formations were identical except for the speed of the 
vocal-cord movement, they could not be combined into a new com- 
posite type. Laryngealization may conveniently be said to be trilliza- 
tion with superimposed voice. Knowledge of the precise method of 
production must await instrumental analysis. In these sounds the 
separate percussions of the glottal trill are audible while at the same 
time regular voice vibrations add their characteristic sound. The 
combination can be sung up and down the scale. In English one 
often hears laryngealized vowels. If a person speaks in a low tone 
of voice, and the voicing tends to be obscured or turned into a “ rum- 
ble ” or “ growl,” two stages of this change can be heard. The first 
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is that in which voice is preserved but separate percussions from 
glottal trill begin to be heard more or less dimly, or even strongly. In 
the second stage the normal voice vibrations are completely eliminated, 
and only trillization remains. The sequence of vocoids and contoids 
remains the same, since vocoidal quality is more the product of the 
oral resonators than of the vocal-cord movement. In this respect 
trillized or laryngealized speech parallels whispered speech except 
that a different type of substitution is made for voice in the vocoids 
and voiced contoids. 

A certain type of vibration of the vocal cords is known as jalsetto 
(i.e. false voice). In this formation a certain “ set ” is given to the 
glottis which may be carried through other sound types — not just 
the vibratory trill. This characteristic is not subject to auditory ar- 
ticulation analysis, but certainly includes some type of reduced aper- 
ture and consequent diminished air stream ; when sibilants are uttered 
with the vocal cords in position for falsetto, they are reduced in audi- 
bility just as they are when trillized. Passing from voice to false 
voice, and vice versa, provides the basis for yodcling. Most women 
seem incapable of using false voice ; the author has recently observed 
a girl of fifteen months use a type of yodeling, while crying, where 
high screams appeared to be in false voice and the “ break ” to voice 
seemed to be clearly audible. Some women can “ squeal ” or scream 
in false voice. 

False whisper is a whisper produced when the cords are in posi- 
tion for false voice but are not vibrating. Such a whisper is quite 
sharp and high-pitched, but not so high or sharp as a whisper which 
has voice position of the vocal cords and simultaneous faucal con- 
striction. False trillization is a glottal trill produced from falsetto 
position. False laryngealization is a combination of false trillization 
and false voice. False voice, false laryngealization, and false trilliza- 
tion can all be sung up and down the scale with the same pitch and 
qualitative characteristics that falsetto has in relation to voice on the 
one hand and that trillization and laryngealization have to voice on 
the other. 

Fortis articulation entails strong, tense movements within the 
types of articulation already described but relative to a norm assumed 
for all sounds; this norm cannot be delineated, but is a convenient 
fiction as a basis for comparison. Weak articulation is lenis. Fortis 
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movement of an initiator tends to make relatively loud sounds, and 
brings acoustic judgments to bear on the fortis nature of sounds. To 
a large extent in general practice the term “ fortis ” is a term of con- 
textual function, indicating that one sound is louder than another 
contiguous to it. In order to separate usage in regard to articulatory, 
acoustic, and contextual data, this study will use “ fortis ” and “ lenis ” 
of articulation only; “ loud ” and “ soft ” of acoustic judgments as 
regards an assumed norm of loudness; “ contextually loud ” to show 
acoustic judgments of the relation of one sound to another in context. 
Thus the hissing of English [s] is loud, but not fortis, in comparison 
with [z] ; the lingual strictures seem to be the same (hence neither 
fortis nor lenis), but the noise of the hissing of the [s] is louder be- 
cause it has a stronger air stream at the lingual stricture (the partial 
glottal closure for voicing reduces the strength of the air stream 
for [z] ). 

Spasmodic articulation is a type produced by sudden movements 
beyond the control of the individual. Sounds employing this articu- 
latory device include the cough, sneeze, hiccough, and frequently 
laughter, or belching. Even if the phonetician can voluntarily simu- 
late these sounds, they are apt to fall short of the genuine spasmodic 
production. Trills have an automatic repetitive element, but their 
initiation is not spasmodic. 


Stricture 

FUNCTION OF STRICTURES 

In Chapter IV, differences were noted in the action of various 
kinds of strictures and of the various cavities. They have distinct 
places in the economy of sound production. In this section a working 
statement is given to facilitate description of these differences. 

When the closure (or, more rarely, the near closure) of a passage- 
way allows that moving section of the wall of the air chamber to be 
an initiator, the closure is an initiating stricture. The closed glottis 
is an initiating stricture during implosives and glottalized sounds. 
Velar closure is of the initiating type during clicks and egressive clicks. 

When two cavities provide potential egress to an air stream, but 
one of the passageways is closed by a complete stricture, so that the 
air is shunted out the other cavity, that closure is a valvate stricture 
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or valvate articulator . Such closures act like valves ; in a pipe organ 
the bellows would correspond to an initiator, and the mouth of a 
pipe to an articulator; but a valve which opens to allow air to reach 
a pipe while not altering the sound as such would be roughly analogous 
to a valvate stricture. In all oral sounds with lung air the closed 
velic prevents air escaping through the nose; only this valvate stric- 
ture makes [s], [f], [z], [i] into purely oral sounds. During na- 
salized oral sounds there is no valvate stricture, and therefore air 
escapes through both the nose and the mouth. For the production 
of nasals a valvate stricture must be present in the oral cavity; the 
choice of the point at which this closure occurs makes the difference 
between [m] and [n] . 

The closure of the entrance into the esophagus is related to valvate 
strictures, but differs in several respects. Its normal position is that 
of closure, with no compressed or rarefied air stream in the esophageal 
cavity. Further, even were the entrance to the cavity open, no egress 
would be provided for the air stream. Closures which shut off such an 
egressless cavity are subvalvate. During (voiceless) pharynx-air 
sounds the closed glottis serves simultaneously in initiating and sub- 
valvate functions. During belches which have no voicing from a 
simultaneous stream of lung air the glottal closure is subvalvate but 
not initiating. 

In reference to the other cavities the oral cavity may be classed 
as primary, the nasal one as secondary, and the pharyngeal cavity as 
tertiary. The pulmonic and esophageal cavities need not be given 
rank in this system since their function is restricted to that of initia- 
tors except for sounds which under normal conditions are below the 
perceptual level (e.g. the sound of air within the alveoli of the lungs; 
medical men use a stethoscope to listen to these sounds). 

Various reasons can be presented for the convenience of this 
ranking. A great variety of strictures can be made in the oral cavity 
since the tongue is extremely flexible and can make contact at many 
different points; in the nasal cavity only a few different strictures 
can be made (at the velic; the nostrils have little or no audible effect 
on sound when they are moved), and the kinesthetic sense of their 
presence is weak; strictures in the pharynx tend to group themselves 
into a very few kinesthetically recognized types or under imitation 
labels because of the lack of such sensation. The individual oral 
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variations therefore tend to become prominent, while the others are 
readily described as modifications of these detailed types. Oral sounds 
can be produced by oral mechanisms acting by themselves, but all 
pharyngeal sounds are in some way modified by the oral or nasal 
cavities (the continuants, by the air passing through such cavities; the 
voiced stops, by induction mechanisms) . Oral quality is more basic 
than pharyngeal quality in that vowel type or timbre persists despite 
various glottalic or pharyngeal formations ; were pharyngeal and oral 
strictures treated as of equal function, voiceless and voiced [u] would 
have to be placed in basically different categories instead of being 
considered varieties of the same vocoid, with its form modified by 
voicing. Strictures of equal type, degree, and function, except for 
their placement in different cavities, can in this way be separated ac- 
cording to their decidedly different usages (e.g. fricative partial 
strictures at the lips, at the velic, and at the glottis do not affect total 
sound production in analogous ways; this contrast is due to differ- 
ence in cavity function). During a [u], friction at the glottis would 
leave the sound a vocoid, but labial friction would change it into a 
contoid. 

Primary strictures include all articulating strictures within the 
oral cavity. In an alveolar click stop, for example, the velar closure 
is not primary, since it is an initiator, not an articulator ; the alveolar 
closure is primary. 

Primary strictures do not all function alike and need to be divided 
into several groups. A primary valvate stricture is an oral valvate 
closure during nasal sounds (e.g. the alveolar closure for [n] ) . 

When two or more oral strictures are present which are neither 
valvate nor initiating, there may still be differences of function be- 
tween them. In a labialized sibilant the labial stricture is subordinate 
to the alveolar one; in a palatalized labial stop the palatal stricture 
is subordinate to the labial one. When two oral strictures occur, the 
one which has the closest position, which approaches nearest the state 
of closure, is primary and the other subprimary ; three degrees of 
stricture are usually sufficient to classify this differentiation: (1) a 
closure; (2) a stricture producing audible local friction; (3) a stric- 
ture with inaudible friction or cavity friction only. Thus the labializa- 
tion of [s] was subprimary, since it was frictionless as contrasted with 
the primary fricative stricture at the alveolar arch ; the palatalization 
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of [p] was subprimary whereas the labial closure was primary, since 
the first had frictionless position whereas the second had closure. 

If within the oral cavity two strictures occur which are neither 
valvate nor initiating and which are of the same degree of closure, 
they cause a double primary stricture (or a double subprimary stric- 
ture if some other noninitiating stricture is of a higher degree of closure 
in the cavity) . The most common type of double primary stricture 
is that of a high but rounded vocoid (e.g. [u], which has frictionless 
labial and velar strictures). One can also make a fricative double 
primary stricture with fricative strictures at the same velar and labial 
points of articulation; in this case the velar stricture has to be of a 
close variety or else its sound will be masked out and made inaudible 
by the vibrations at the lips. Lung-air stops with closures at the 
velum and the lips (plus a valvate closure at the velic) might be de- 
scribed as having double primary strictures, but if the front closure 
releases first it becomes a percussor for an oral static mechanism which 
is present simultaneously with a pulmonic air mechanism whose pri- 
mary stricture is the velar closure. 

Secondary strictures are produced in the nasal cavity. A partial 
velic stricture which gives harsh friction at that point is a secondary 
stricture, as is the cavity friction of nasal resonance during nasalized 
sounds. 

Cavity differences separate the secondary function of velic closure 
from that of primary oral closure in stops like [o], [b], fk], even 
in sequences such as [mbm], [ntn]. In fact, it is convenient upon 
the basis of this difference in stops, and the valvate function of velic 
closure in oral spirants, to state that a velic closure is always valvate. 
Descriptively, then, for stops the velic “ first ” closes off the nasal 
cavity, and “ then ” the oral closure arrests the air stream; the actual 
order does not always conform to this in groups like [mbm]. Our 
normal writing of stops in such a sequence by means of a letter em- 
phasizing the position of the oral closure ( [p] in contrast to [t], bi- 
labial in contrast to alveolar articulation) reflects the basic assump- 
tion of this functional difference, with the oral stricture primary. 

Undesirable alternatives to this descriptive order in stops are to 
consider the velic and oral closures of equal rank (in which case 
there would be no explanation for their functional differences), or 
the approaching closure primary (which would force the interpreta- 
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tion of the second segment in [nt] as a “ velic stop ”), or the release 
primary (which would force the first segment in [dn] to be a “ velic 
stop,” and would prevent the analysis of unreleased final stops). If 
either the approach or the release is to be considered essential to the 
analysis of a stop, then one of the goals laid down for this investigation 
could never be attained — a system which allows the description of 
a phonetic fraction at any point in a sequence — since under these 
conditions the center of a stop could never be analyzed by itself. 

Basic classification of nasals ( [m] , [n] , and the like) is usually 
done by means of the position of their oral closures. This differentia- 
tion is emphasized by the primary position of their valvate strictures, 
while their nature in its broadest features is determined by the cavity 
friction of the passageway in the direct line of the air stream, that is, 
the nasal cavity (cf. escape cavity, p. 140). 

Tertiary strictures are those within the pharyngeal cavity. They 
include all types of glottal modification and strictures made by the 
wall of the pharyngeal cavity, the epiglottis, faucal pillars, and so on. 

The only strictures which are pertinent to the description of any 
phonetic fraction are those which exist within the active air chamber 
for that sound. A glottal stop, for example, has one tertiary stricture, 
the glottal closure, but has no secondary or primary strictures. The 
stop [k] has a secondary velic valvate stricture, a subvalvate esophag- 
eal stricture (which can usually be assumed to exist without specific 
mention of it), and a primary velar closure; labialization would not 
be subprimary to the sound since the lips are not within the active 
chamber (which includes only the pharyngeal and pulmonic cavities 
and that section of the oral cavity which is behind the velar closure) . 

The acme stricture of any sound is the highest ranking one within 
the active chamber. With glottal stop the acme stricture is the ter- 
tiary glottal closure, since neither secondary nor primary strictures are 
present within the active chamber (which includes only the pulmonic 
cavity and that lower edge of the pharyngeal cavity which the glottis 
itself constitutes). With [k] the acme stricture is the primary velar 
closure. With labialized [s] the primary alveolar stricture, not the 
subprimary labial one, is the acme stricture. The acme stricture, 
therefore, is the articulator with the nearest approach to closure within 
the highest ranking cavity of the active air chamber. 

All noninitiating, nonvalvate, nonacme strictures are qualifiers, 
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unless, as in nasals, the acme stricture is itself a valvate closure. No 
stricture may be considered to qualify a valvate closure even if it 
possesses higher rank; nasal resonance in nasals does not merely 
qualify the oral closure but, rather, is basically and essentially char- 
acteristic of the resultant sound type. Qualifiers modify the effect 
produced by the acme stricture itself. Thus labialization may qualify 
an [s], voicing may qualify a labial stop, nasalization may qualify a 
vocoid. In the descriptive order one first locates the active air cham- 
ber and then labels the initiating, valvate, and acme strictures (al- 
though the last two may sometimes coincide; e.g. in [n] the oral clo- 
sure is the acme stricture, but valvate) ; all other strictures (except 
for the secondary cavity in nasals) are qualifiers, whether tertiary, 
secondary, or subprimary. 

Whenever any sound is qualified by cavity friction from a stricture 
of lower rank than the acme one, whether or not the stricture is in the 
same cavity (e.g. subprimary), it has been modified. Such modifica- 
tion by the lips is labialization; by the middle of the tongue ap- 
proximating the palate, palatalization ; by the back of the tongue at 
the velum, velarization; these are subprimary types of qualification. 
Nasalization causes the secondary modification of sounds by adding 
nasal . resonance to them. Tertiary modification includes qualifica- 
tion induced by the approximation of the faucal pillars (and other 
phenomena in the pharynx not subject to observation in a hand mir- 
ror), jaucalization, or by the approximation of the wall of the pharynx 
to the root of the tongue, pharyngealization. 

The frictionless iteratives may also be modifiers. The initial 
stages of a glottalized bilabial stop, for example, may have the modifi- 
cation of an alveolar trill. The most frequent modification of related 
type is glottal vibratory trill applied to any lung-air sound (except 
glottal stop) to make it voiced. Trillization and laryngealization 
may likewise modify the same types of sounds, but with a different 
resultant acoustic quality. 

Whenever the acme stricture of any continuant or stop is qualified 
by a stricture of lower rank which gives local friction (see pp. 71-72, 
138-39), the sound has been frictionalized. Thus nasals, frictionless 
laterals, or vocoids may be frictionalized by glottal stricture during 
whisper, or by any other local friction in the pharyngeal cavity. 

It becomes difficult to frictionalize sounds with outer primary local 
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friction since the inner friction tends to be masked out by the outer 
type; simultaneous inner and outer friction can be produced (e.g. 
glottal friction as for a whisper during [s] ), but in order to be heard 
the inner stricture must usually be closer than would be necessary 
if it were qualifying a vocoid. 

Stops may be temporarily frictionalized by the movement of the 
capped air stream through some partial stricture before the final 
equalization of pressure. A bilabial voiceless stop, for example, may 
have at its commencement the weakly added sound of air pushing past 
a close partial stricture of the tongue at the velum. Stops may also 
be frictionalized by whisper, and so on. 

The fricative iteratives may qualify sounds. Vocoids which are 
frictionalized by a fricative glottal vibratory trill are timbres of 
“ voiced [h] .” 

The trills can themselves be qualified. An alveolar trill, for 
example, may have velar friction added. The trills can also be modi- 
fied by labialization and similar processes. The most common qualifi- 
cation of a trill, as well as of any other sound type, is by a tertiary 
vibratory trill, voice. Voice can be added even to a triple primary 
trill wherein the lips, tongue tip, and uvula are all trilling simultane- 
ously. 

When strictures are present during some sound but lie in a passive 
chamber rather than the active one they are passive primary, passive 
secondary, or passive tertiary types. In [k] any labialization is pas- 
sive primary, as are palatalization and the like. In describing se- 
quences, rather than individual fractions, it is convenient to call pas- 
sive strictures semipassive if they are due to become active in the 
following segment. All semipassive or passive strictures are of in- 
ferior rank to strictures within the active air chamber. In the se- 
quence [gwa], for example, the labialization of the stop in anticipa- 
tion of the following release to [w] is semipassive primary and of 
lower rank for the active pulmonic-pharyngeal air chamber than is 
the tertiary stricture at the glottis which produces the voicing. 

When two or more mechanisms are used simultaneously to form 
a sound, each mechanism may have its own acme stricture. In a 
voiced alveolar click the acme stricture for the lung air is the inner 
side of the velar closure, while the acme stricture for the mouth air 
is the alveolar closure. In [g] the closure at the velum is the acme 
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stricture; since the vibration is carried to some slight extent through 
that obstruction, the oral cavity becomes an induction mechanism, 
and, if there is rounding, the lips may act as an active primary stric- 
ture for that mechanism. This effect of the labialization is above the 
threshold of perception only in extraordinary circumstances (e.g. 
when the lips are rapidly opened and then overrounded in repeated 
succession, and even here the audibility is very slight indeed), so that 
in general this induction mechanism is not a part of a perceptual unit 
and can be ignored. 

When a single closure has two or more simultaneous functions in 
which the inner side of the closure causes the strictural function for 
one mechanism and the outer side does so for the second, it becomes 
convenient to discuss the closure as a single stricture with double 
function rather than to describe it as articulating simultaneously at 
separate and unrelated inner and outer points. In the voiced nasaliza- 
tion of an alveolar click the acme stricture for lung air is the velar 
closure; the same closure functions as the initiator for the simultane- 
ous mouth air. 

During the strictures for most continuants only a part of the poten- 
tial opening is used. For bilabial fricatives the sides of the lips tend to 
be closed; during vocoids the side of the tongue often makes some 
contact; with whisper the anterior part of the glottis may be closed. 
Contacts produced in this way for parts of potential openings are 
adjuncts to primary, secondary, or tertiary strictures. Adjuncts are 
not valvate, since they do not completely prevent passage of air 
through a cavity; they restrict the size of an opening by their contact, 
without completely closing off its aperture. 

Side adjuncts such as the contact at the edges of the lips for [w] 
produce central continuants. Center adjuncts produce lateral con- 
tinuants such as [ 1 ] . Adjuncts may occur on both sides of a passage 
or on a single side only; they may have contact on one side and the 
center of a passage for unilateral sounds. 

Strictures function in two general ways. One type initiates air 
streams and controls the cavities through which the air stream finds 
egress. The other type interferes with the air stream in such a way 
as to cause vibrations. Functions of the second type of stricture 
differ principally ( 1 ) according to the cavity in which they occur and 
( 2 ) according to the degree of stricture. 
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DEGREES OF STRICTURE 

In Chapter V the problem of consonant-vowel division was dis- 
cussed. Strictures do not all interfere to the same extent with the 
transit of air through a passageway. Differences of acoustic result 
in certain respects demand acoustic criteria, as was shown in that 
chapter. Further, as was there demonstrated, criteria for vowel-con- 
sonant delineation are so varied that the division of sounds into those 
two groups has not been satisfactory. The present section of Chap- 
ter VII attempts to carry the classification of sounds a step further 
by utilizing both acoustic criteria and criteria of degrees of stricture. 
Resultant from this procedure is a classification of sounds which, 
among other things, gives the author’s answer to the vowel-consonant 
problem. 

In relative terms not objectively fixed the extent of the region 
of stricture (from another point of view the size of the articulator) 
from front to back may be extensive or restricted; [J] has a more 
extensive stricture (a longer groove) than does [s] . From side to 
side the opening may be narrow (limited by adjuncts) or wide; 2 the 
stricture for [0] is wider than for [u] . From top to bottom the open- 
ing may be close or open; [i] is closer than [e] . 

Any of these contrasted groupings can be subdivided into various 
arbitrary numbers of relative degrees. For close versus open stricture 
in vocoids investigators often find it convenient to use six: high, mid, 
low; close and open high, close and open mid, close and open low. 
Jones’s “ cardinal vowel ” scheme 2 3 is perhaps the most convenient 
and objective classification of this type. 

The shapes of articulators may be described by using these terms, 
A grooved tongue, for example, is narrow and open with extensive 
region of stricture; the deeper the groove, the opener the stricture. 

A stricture may be retained for varying lengths of time: the dura- 
tion may be long or short (or of other lengths described with refer- 
ence to some relative, convenient standard). In pitch, especially 

2 This departs from more general usage which makes “ narrow 7 ” and “ dose,” 
wide ” and “ open,” more or less synonymous. Since two sets of contrasts need 

to be represented it seemed better to specialize the first terms for slightly different 
meanings. 

3 Jones, An Outline of English Phonetics 4 , §§ 131-47. 
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in relation to a vibratory trill of the vocal cords, sounds may be 
high, low, or mid. 

All of these differences can be measured by instruments in terms 
of millimeters, seconds, vibrations per second, and so on. If points of 
reference are established by such a measurement, then it may be 
stated that strictures are objectively extensive, narrow, close, long, 
high, and so on, in respect to these standards. Apart from such a 
standard, these terms are objectively vague since they are then relative 
to a standard which is little more than the subjective impression of 
a particular observer for the median usage of a particular individual. 
Apart from such an objective standard the terms readily become influ- 
enced by contextual factors. 

Strictures may be complete or partial. The first type completely 
closes a passageway, so that no air can pass through it during the 
maintenance of the stricture. Partial strictures diminish the size of 
a passageway, but not so much that air cannot pass through. The 
oral closures in stops and nasals are complete strictures; the oral 
strictures in all fricatives or vowels are partial. 

Partial strictures may be of a local or cavity type (see pp. 70-72) . 
In voiced sounds local strictures produce a friction noise which can 
be heard above the voicing, but the cavity strictures of voiced sounds 
affect the resonance and quality of the sounds without giving audible 
friction at the specific points of their strictures. Thus the criterion for 
the difference is an acoustic one — presence or absence of localized 
friction in voiced sounds. In voiceless sounds the difference is still 
present, but is not so marked, and the border line is not so clear-cut ; 
localized friction caused by air passing over a sharp edge or through 
a small hole contrasts with cavity friction caused by the voiceless air 
stream passing through a cavity (whose shape is determined in part 
by the stricture in question) and inciting the natural resonance of the 
cavity. Local friction tends to be nonperiodic, or noisy; cavity fric- 
tion (especially in voiced sounds) tends to be periodic, or musical. 
Samples of local stricture are glottal position for whisper, the alveolar 
articulation in [s], [z], and the labiodental articulation in [f] ; 
cavity stricture includes labial position for [u], velic or nasal posi- 
tion for [m], glottal position for voiceless vowel. 

When no obvious change of the position of some part of the wall 
of a cavity makes a partial stricture (e.g. when in the oral cavity 
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the tongue is in flat position of rest rather than humped for a high 
vocoid and the lips are open and relatively flat rather than rounded), 
the cavity as a whole functions as a cavity stricture. The whole nasal 
cavity acts in this way for nasalization of other sounds. When a par- 
tial stricture, such as high tongue position, is of the cavity type, it 
combines with the unchanged portion of the cavity wall so that to- 
gether they act as a single unit in cavity friction. 

Local friction versus cavity friction represents an acoustic differ- 
ence with an acoustic criterion to draw the line of demarcation at 
the friction point between them. The identical articulatory mecha- 
nism with the identical strictures may produce in certain cases either 
cavity friction or local friction if the pressure of the air stream varies 
because of changes of the pressure of the initiator. The sound [I] 
with normal pressure of the air stream has only cavity friction. If 
the sound is made very loud, with extra lung pressure, local friction 
may be heard at the high front part of the tongue. If the lung pres- 
sure is very weak, no sound whatsoever will be heard. When all es- 
sential articulatory movements and all requisite strictures are present 
to produce a certain sound, but the pressure of the initiator is so weak 
that neither cavity nor local friction is above the threshold of per- 
ception, zero sounds of various types result (e.g. most frictionless 
continuants with pharynx air) . Assuming a norm of pressure, a sound 
which would under that norm give cavity friction becomes an ac- 
centuated friction sound when local friction is heard because of supra- 
normal pressure. A localized fricative which loses that friction and 
is heard as a sound with cavity friction because of subnormal air- 
stream movement is debilitated sound. For example, the sibilant [z] 
usually becomes a debilitated vocoid if trillization is substituted for 
voice, since the air stream which is allowed to pass through the trilling 
glottis is insufficient to produce audible local friction at the alveolar 
arch. 

An air stream as a whole may be interrupted in ways analogous 
to that in which a single passageway is blocked. Stops are sounds such 
as [p] , [t], [d], in which the air stream is completely interrupted, 
or capped. Continuants are sounds in which the air stream has some 
egress, as in [s], [f], [m], [0], through some partial stricture (or 
open cavity functioning as a cavity stricture) . Stops may require one 
closure in order to interrupt the air stream (e.g. glottal stop or pharyn- 
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geal stop), two closures (all oral stops with lung air require valvate 
velic closure also), or three closures (oral stops with pharynx air add 
initiating glottal closure; double stops have two oral closures and 
velic closure, with lung air). A few stops use four closures (double 
stops with pharynx air; clicks with simultaneous glottalized or im- 
plosive velar stop with velic release). In addition to the closures 
enumerated, these sounds have a subvalvate esophageal closure ; only 
in glottal stop, clicks, and sounds of some minor mechanisms is it 
not theoretically essential, but even there it is present except in the 
belch sounds. 

Continuants are best subdivided in two ways. The two classifica- 
tions used together are much more convenient than either one by itself. 

1 . When local friction versus cavity friction is the basic criterion, 
all continuants with local friction at any point are jrictionals (e.g. 
[is], [f], whispered vocoids, fricative [1], “voiced [h] ”), while all 
continuants with no local friction (e.g. [m], [o], [h], frictionless 
[1] ) are nonjrictionals (or frictionless sounds, or sounds with cavity 
friction only). 

2. The second classification of continuants utilizes a more com- 
plicated set of differentiating criteria. The first dichotomy of the 
second classification is made by a choice of the highest ranking escape 
cavity, that is, the cavity of highest rank through which the air stream 
is escaping or entering the vocal apparatus to or from the atmosphere 
(cf. pp. 133, 141-44). Orals include all continuants in which the air 
stream passes through the primary (i.e. oral) cavity (e.g. [f], fo], 
whispered [o] ) ; orals include, also, sounds in which the air is passing 
at the same time through the secondary (i.e. nasal) cavity (e.g. na- 
salized [a]). Nasals comprise all sounds wherein the air escape is 
limited to the nasal cavity (e.g. [m], and voiceless [m] with strong 
local velic friction as in the rough clearing of the nose), that is, where 
there is a primary valvate stricture. 

There can be no parallel third group of continuants with the 
pharynx as the highest-ranking escape cavity, since in every continu- 
ant in which the air passes through the pharynx it also passes through 
either the nasal or the oral cavity, both of which outrank the pharynx. 
If the air stream passes through the pharynx but is capped by closures 
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at the velic and in the oral cavity, the sound is no longer a continuant 
but a stop (e.g. [b], [k]). 

The next subdivision of this second classification of continuants 
is made by the criterion of the presence or absence of local friction 
in the highest-ranking escape cavity. In this classification, as is not 
true of the one which divides continuants into frictionals and non- 
frictionals, local friction or the lack of it is completely disregarded 
when it occurs in any other than the highest-ranking escape cavity. 
Orals with local friction in the oral cavity are fricative orals (e.g. [f] , 
[s] ) ; when without oral local friction, orals are resonant orals (e.g. 
[0], [ 1 ], whispered [a], timbres of “voiced [h] ”). Nasals with 
local friction at the velic are fricative nasals (very rare, limited al- 
most to types of rough clearing of the nose) ; when without velic 
local friction, nasals are resonant nasals (e.g. [m], [n]). For con- 
venience, however, resonant nasals are called simply “ nasals ” else- 
where in the study unless this type of context necessitates the use 
of the adjectives “ fricative ” and “ resonant ” for clarity. The 
fricative nasals are very rare even as nonspeech sounds. 

A third criterion divides orals, but not nasals. Some orals have 
central escape of the air in respect to the tongue or lips, whereas 
others have a lateral air escape. Children frequently amuse them- 
selves by making linguolabial lateral resonant orals in which the 
tongue tip is placed between the lips and moved from side to side; as 
the tongue tip passes the median point in the lips the air stream is 
momentarily but completely capped. The orals comprise central 
fricative orals (e.g. [s] ) ; lateral fricative orals (e.g. fricative [1] ) ; 
lateral resonant orals (e.g. frictionless [1] ) ; and central resonant 
orals (e.g. [o], [h]). 

A diagram for comparison of the two classifications is given in 
Charts 1-2. 

Since in the second classification the criterion of local friction is 
applied only to phenomena in the highest-ranking escape cavity and 
no reference is made to phenomena in the others, it must be carefully 
noted that resonant orals or resonant nasals which are without local 
friction in their escape cavities may yet be frictionals because of 
local friction elsewhere. When these same sounds were described from 
the viewpoint of the function of their strictures (see p. 134), they 
were said to be frictionalized vocoids, and the like, because of the 
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Chart i 
Continuants 

Frictionals Nonfrictionals 


Chart 2 
Continuants 
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subordinate function of the frictional strictures; frictionalized vo- 
coids are simultaneously frictionals and resonant orals. Frictionalized 
resonant orals include the following: whispered vocoids and whis- 
pered frictionless laterals ; vocoids and frictionless laterals with voiced 
glottal local friction; vocoids and frictionless laterals with voiced or 
voiceless pharyngeal local friction; vocoids and frictionless laterals 
with local velic friction; and so on. Frictionalized nasals include 
nasals with any type of pharyngeal local friction. 

Timbres of [h] (i.e. voiceless vocoids) and the average voiced 
vocoid (i.e. the ordinary voiced “ vowel ”) are both nonfrictionals, 
and also are both resonant orals. Whispered vocoids (i.e. the average 
whispered “ vowel,” voiceless vocoids with added glottal local fric- 
tion) and timbres of “ voiced [h] ” (i.e. voiced vocoids with glottal 
local friction) are frictionals, but also are resonant orals. If one 
represents voiced vocoids by V, and voiceless whispered ones by W, 
convenient proportions may be given (cf. p. 72) to show these re- 
lationships: 

[h] : W:: V: [fi] or V: [h] :: [fi] : W 

The degree of openness of strictures in sounds partly determines 
the frequency with which the segments appear as syllabics (cf. pp. 
117-18). Instead of dividing sounds into stops and continuants, 
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therefore, one may divide them into types which are most frequently 
syllabic, less frequently syllabic, and rarely or never syllabic. The 
contents of the groups might be determined by an elaborate frequency 
count in languages over the world, but the three most basic groups are 
readily identifiable without this procedure. 

The phonetic description of these groups has to be made in terms 
of the same segment characteristics which have already formed the 
basis of previous classifications. The first classification of continu- 
ants, above, into frictionals and nonfrictionals, gives results which 
cannot correlate syllable features neatly, since syllabic action is con- 
ditioned not only by the degrees of stricture, but also by the rank 
of the cavities in which are present the different strictures. Precisely 
for this reason subdivisions of that classification do not serve a suf- 
ficiently useful purpose to be included here, and therefore have been 
omitted. 

The second classification into continuants, however, takes account 
of the rank of cavities and can be readily used for neat descriptions 
of the three major groups of sounds with different syllabic frequencies. 

The sounds which as a group function most frequently as syllables 
are vocoids. Phonetically they comprise the central resonant orals as 
already defined. Vocoids include practically all sounds which are 
usually called “vowels” (whether voiced, voiceless, or whispered), 
except that “ fricative vowels ” are excluded, while “ vowel glides ” 
such as [r], [w],and [y] are included. 

All nonvocoids are contoids. This term more closely approximates 
the general term “ consonant ” than any other used by me. Contoids 
include stops, fricative nasals, lateral resonant orals, and central 
fricative orals. 

The resonant nasals and lateral resonant orals comprise a group 
of contoid sounds that are often syllabic, but by no means so often 
as the vocoids. The remaining controls (the stops, fricative orals, 
and fricative nasals) are rarely syllabic. 

The resonants comprise the resonant orals (central and lateral) 
and resonant nasals; they have no local friction in their escape cav- 
ities (possible exceptions may be nasally frictionalized vocoids — 
a sound type exceedingly difficult to make, if not unutterable) . They 
are the sound types most frequently syllabic. The nonresonants 
comprise the fricative orals (central and lateral) and stops; they 



144 


PHONETICS 


impede the air stream sharply, by having local friction in an escape 
cavity, or by interrupting the air stream entirely. They are sound 
types which are rarely syllabic (but see the syllabic [s] in ‘ pst! ’ 
and many other sounds as syllabics when pronounced in isolation). 
The sonorants are nonvocoid resonants and comprise the lateral res- 
onant orals and resonant nasals (e.g. [m], [n],and [1]). The pres- 
ence or absence of local friction in the pharyngeal cavity does not affect 
possible syllabic function, as can be seen by the use of whispered 
speech in which glottally frictionalized voiceless vocoids are the nor- 
mal syllabics. 

Chart 3 shows in diagrammatic form the three groups arranged 
according to syllabic frequency and according to the resonance of 
the escape cavities. 

Chart 3 
( Read down ) 

Sounds 



Sounds 
(Read up) 

The phonetic groups may be realigned and additional nomencla- 
ture given them according to their function in phoneme and syllable 
contexts. Since phoneme structure differs from language to language, 
final labels must be decided upon for the functional groups of each 
language after its analysis by the phonemicist. Usually, however, 
phoneme structure is so closely related to (or conditioned by) syllable 
structure that, for many languages, groupings by syllable-context 
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function is sufficient. The terms used below in regard to syllable 
function remain flexible and subject to change and amplification by 
specific phoneme function (cf. p. 78), but are nevertheless sufficiently 
stable to be of general value and to serve in phonetic, if not always 
in phonemic, descriptions. 

All vocoids are simultaneously vowels at the time that they are 
functioning as syllable crests (syllabics: e.g. the vocoid segments in 
‘ camp,’ ‘ bit,’ ‘ fur ’). All contoids are syllabic contoids when they 
are functioning as syllable crests (e.g. the syllabics in 1 pst! ’ ‘ sh! ’ 
‘ mhm ’ ; the second syllabic in ‘ bottle,’ ‘ button ’ ; the isolated un- 
released sound [b] ). 

All vocoids are nonsyllabic vocoids while functioning as nonsyl- 
labics (e.g. [y] in ‘ young ’; [w] in ! woo [r] in ‘ rich ’). All con- 
toids while functioning as nonsyllabics are consonants (e.g. the first 
sounds in 1 pie,’ ‘ see,’ ‘ life,’ ‘ nose ’). 

It should be noted that the vocoid-contoid division is a dichoto- 
mous one, of all sounds, but that the vowel-consonant division is 
not, since vowels and consonants do not, in the definition of this study, 
include syllabic contoids or nonsyllabic vocoids. A possible alternate 
nomenclature would call all sounds “ vowels ” when syllabic, and all 
sounds “ consonants ” when nonsyllabic; but this would merely dupli- 
cate the terms “ syllabic ” and “ nonsyllabic,” and depart much 
further from current usage of the terms “ vowel ” and “ consonant ” 
than is at all necessary. 

Charts 4 and 5 diagram the relationship between vowels and con- 
sonants. 

Chart 4 
Sounds (all) 

when functioning as syllable crests (syllabics) 

1 

I I 

Syllabic vocoids Syllabic contoids 
(vowels) 


Chart 5 
Sounds (all) 

when functioning as nonsyllabics 


Nonsyllabic vocoids Nonsyllabic contoids 

(consonants) 
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Trills are most conveniently classified as special types of partial 
stricture rather than as closures. Taken as a unit, a trill gives an 
impression of continuity; it allows an escape of air; its automatic repe- 
tition differentiates it from specific repetitions of the closures of stops, 
so that the trill must be analyzed as a unified whole, not a sequence of 
unrelated closures. Now with lung air, no oral stop can exist while 
the velic is open, else air will escape in a continuous stream from the 
nose ; by definition that prevents trills from being classified with stops, 
since they have an air stream that is completely capped. The sound 
made by opening the velic during stop production is a nasal (e.g. [b] 
is changed to [m] in the sequence [bm]). With trills this is not 
true; an alveolar trill, for example, remains that same trill and re- 
tains all its essential characteristics if the velic is open — it simply 
becomes qualified by the addition of nasal timbre, as would any other 
oral continuant, such as [i] or [1] or [z] . 

In vibratory trills the impression of continuity and unity (with 
no acoustic separation of the percussions) is enhanced. With these 
there can be no doubt of their essential classification with the continu- 
ants. To list voice with any deliberate (and essentially slow) non- 
automatic repeated closing and opening of the vocal cords in a series 
of lung-air glottal stops would be unthinkable. 

Like other continuants, trills can be of fricative or frictionless va- 
rieties. The latter type is clear, and “ liquid,” whereas the former is 
very “ breathy.” In the one the release between closures leaves a 
cavity stricture, so that no friction is audible, whereas in the other 
the release is to a position which gives local friction. Both types ap- 
pear in language ; an alveolar frictionless trill is used in Spanish ; ap- 
proximately the same position is employed in Mixteco for a phoneme 
which has a fricative trill as one variant. 

When certain articulatory positions of the vocal organs are taken, 
a shrill penetrating sound results, called a whistle. No articulatory 
criterion can be given by the author to state at exactly what point the 
shifting position of the tongue or lips (or even the pharynx and glottis) 
may change continuants into whistles. In this respect the whistle 
point, like the friction point, must be established by an acoustic 
criterion rather than an articulatory one. When a formation whis- 
tles, it whistles — as anyone knows who after long arduous practice 
finally passes the whistle point in some continuant, only to be per- 
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plexed to determine what change was finally made just as the whistle 
came. 

The average whistle is a kind of double vocoid in which the front 
labial partial stricture is of a special type that gives whistle timbre 
while the lingual stricture controls the pitch. Lingual strictures of at 
least two general types are found, depending on the individual. In 
one type (my own) high pitch is attained by high front tongue po- 
sitions similar to that for [i] , while low pitches have tongue positions 
analogous to that for [u] . In another type the tongue is highly 
grooved ; low pitch is obtained by a backing of the tongue, high pitch 
by its fronting (somewhat analogous to [s] versus [/]). Pitch may 
be controlled in at least a third manner: very sharp retroflexion may 
provide the low tones, whereas the lowering of the tip to more normal 
position raises the pitch. 

Dental whistles use both labial and lingual vocoidal modifications 
for pitch; rounding of the lips gives lower pitches than unrounding. 
The only pharyngeal whistle which the author has heard was made 
by Dr. Fritz Frauchiger; it was produced while the mouth was wide 
open and appeared to come from the throat. It seems possible that 
“ whistle voice ” uses a whistle formation of the vocal cords rather 
than a type of glottal vibratory trill. The author has never heard 
this type of singing. Whistles may be made by pharynx air and mouth 
air as well as by lung air ; in this respect they differ from other vocoids, 
which tend to be of zero type with any mechanism except the pulmonic 
one or except directly after the release of mouth-air or pharynx-air 
stops. 

Whistles may have two distinct tones audible at once. The double 
whistle point can at present be only acoustically established. The 
articulatory formations seem to differ but little from those giving 
single whistles. 

Whistles may combine with many other types of articulation. One 
of the most beautiful, used for imitation of bird calls, combines a 
labial whistle with a (voiceless) uvular trill. Of course whistles can 
be modified by nasalization, voice, and the like. They can be fric- 
tionalized in various ways, or in turn modify oral local friction. 

The strictures that control the egress of an air stream may have 
various degrees of closure which can be described in relation to as- 
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sumed norms, or norms of an articulatory or acoustic nature. Com- 
binations of strictures in the different cavities allow different degrees 
of air escape as they interrupt the air stream in various ways which 
provide characteristic acoustic effects. Classification and designa- 
tion of sound types allow for these characteristic differences in de- 
gree of strictural closure of a passageway or a combination of pas- 
sageways. 



CHAPTER VIII 


CONCLUSIONS 

I N THE preceding chapters of this study various articulatory move- 
ments have been described which cause vocal sounds and modify 
them. The later part of the chapter in hand, instead of discussing 
parts of sounds or attempting to abstract and analyze individually 
their productive movements, will indicate a procedure for describing 
sounds as complete wholes by giving a statement of certain features 
which should be included in each description. Up to this point all 
vocal sounds have been treated without reference to the fact that they 
may or may not be used in linguistic systems. Assuming that a suf- 
ficiently large sampling has been taken of speech sounds to insure 
the inclusion of most of the major types which can be found, a tenta- 
tive statement can now be given of the line of demarcation between 
speech and nonspeech types. • 

Speech Sounds versus Nonspeech Sounds 

A greater difference is seen between speech sounds and nonspeech 
sounds within their usage of productive mechanisms than within their 
controlling mechanisms. Induction and scraping production mech- 
anisms apparently are never used significantly in speech. Percussion 
mechanisms occur in speech only when they function simultaneously 
with air-stream mechanisms either by superimposing their sounds 
upon these mechanisms during the opening and closing of stops or by 
causing oral static percussives during a velar stop which has pulmonic 
or pharyngeal pressure — that is, the first release of double stops. 

Within the air-stream mechanisms there are further differences 
between nonspeech and speech sound types. Minor air mechanisms 
are never used in a system of speech sounds except where belch timbre 
and esophageal air are substituted for voice and lung air in the speech 
of laryngectomized patients. The major air mechanisms are all used 
in speech; ingressive lung air appears to be employed only for vari- 
ants of phonemes in surprise, pain, or occasionally in certain types of 
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rapid speech, and the like ; 1 ingressive pharynx air is almost limited 
to combinations with lung air; egressive mouth air perhaps does not 
occur except in exclamations. 

It seems that only two combinations of major air mechanisms 
occur in speech, the pulmonic mechanism with the pharyngeal one 
and the pulmonic mechanism with the oral mechanism. In these 
combinations the lung air seems always to be egressive and voiced. 

A further limitation appears to exist in regard to the number of 
phonemes which can be produced in uninterrupted sequence by a 
single-direction initiator movement of any one major air mechanism. 
With all of these air mechanisms the number is limited by the length 
of time which the initiator can keep moving in order to expel air. 
Since the lungs as initiator can continue contracting much longer than 
the larynx can continue rising or the tongue fronting, more sounds 
can be produced in a sequence with that mechanism. It appears, how- 
ever, that no more than one phoneme is made by any single-direction 
movement of the pharyngeal and oral initiators. Nasalized, aspirated, 
and voiced clicks may comprise more than single phonemes, but in 
these cases the extra phonemes of nasalized, aspirated, and voiced 
releases are supplied by the pulmonic system. 

The controlling mechanisms of nonspeech sounds are quite similar 
to those of speech sounds. The difference is largely due to the fact 
that extreme departures from a normal position of articulation or 
exaggerated types of movement occur sometimes in nonspeech sounds 
but rarely if ever within phonemic norms. 

Processes of segmentation are the same for speech and nonspeech 
sounds. Both groups use the same articulators. Furthermore, the 
shapes of the articulators are usually of the same types except that 
odd shapes such as that of the tongue twisted lengthwise are not 
known in speech. Points of articulation are similar for the two 
groups except that extreme displacements do not occur in speech (e.g. 
the tongue tip against the uvula). Types of articulation movements 
are likewise the same except that spasmodic forms (e.g. for belches, 
sneezes, hiccoughs, and some kinds of langhter), involuntary chat- 


1 I have heard Dr. Charles Voegelin mention an ingressive vowel, however, 
which is said to occur in Maidu; if it is there the normal form of the phoneme, 
then ingressive lung air is used as a phonemic norm, and this restriction could 
be omitted. 
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ters, and other sorts of involuntary movement do not produce speech 
norms; this group of spasmodically formed sounds includes a con- 
siderable percentage of those which are resultant from momentary 
physiological states. Strictural function can be analyzed according 
to just one set of principles for all types of speech and nonspeech 
sounds. Degrees of stricture fall into the same general classes for 
both groups, and strictures interrupt the air stream in similar ways 
regardless of whether or not the sounds are used in speech. 

A Descriptive Order 

For the systematic description of any sound, the following items 
should be mentioned when their action is above the threshold of per- 
ception : A statement should appear describing the type of mechanism 
by which the sound is made — whether by an air-stream mechanism, 
percussion mechanism, induction mechanism, or scraping mechanism. 
If an air mechanism is basic to the sound, note should be made of 
the position of the initiator and the direction of its movement, the 
presence of any valvate strictures, and the degree to which the air 
stream is interrupted. When above the perceptual threshold, acme, 
primary, subprimary, secondary, tertiary, and adjunct strictures must 
be described with a rough statement of their points of articulation, 
types of articulation, shapes of the articulators, and degrees of closure. 
If two mechanisms combine to make the sound, the features of each 
mechanism must be described separately. The syllable function 
sound should be stated. 

One illustration will be given of this type of description: A pal- 
atalized [b] is made with an air-stream mechanism. The lungs are 
the initiator and have egressive movement. A secondary valvate 
closure is at the velic and a subvalvate closure is at the esophageal 
orifice. The air stream is completely capped, so the sound is a stop. 
The acme stricture is the primary one. The primary stricture has 
labial point of articulation, a normal type of articulating movement, 
and flat shape, with complete closure. The subprimary stricture has 
palatal articulation, a normal type of articulating movement, and 
humped shape, with a modifying cavity-friction degree of closure. The 
tertiary stricture has glottal point of articulation, vibratory trill 
type of articulating movement, flat shape, with a qualifying partial 
degree of cavity-friction closure. The subprimary stricture has a 



1 52 PHONETICS 

subprimary adjunct that has a normal type of complete contact at 
the sides of the tongue. 

Some of the characteristics of the [b] which have been left unde- 
scribed in the preceding paragraph, and which therefore are subsumed 
under this articulatory label as a layer of imitation-label material 
which can only be known by an observer after hearing the sound, are 
the exact point of the palate below which the hump of the tongue 
occurs ; the exact height of the tongue ; the exact shape of the tongue ; 
the position of the tip of the tongue; the position of the root of the 
tongue ; the position of the faucal pillars ; the position of the epiglottis ; 
the position of the false vocal cords ; the position of the back pharyn- 
geal wall; the relative strength of the articulating movements; the 
relative loudness of the sound; the relative duration of the strictures; 
the presence of a passive nasal chamber; the presence of a passive 
esophageal chamber. The sound was made in isolation, not as part 
of a sequence of sounds, and with no release of the oral or velic stric- 
tures. Therefore there could be no mention of its semipassive cham- 
bers or releasing strictures; or of gliding movements connecting it to 
other sounds ; or of anticipatory movements for it or for other sounds ; 
or of the type of segment, segment crest, segment trough, crossing 
glide, bulge of stricture, time bulge, or percussion transition; or of 
its relation to a syllable, syllabic, nonsyllabic, arresting segment, or 
trigger segment (except that the sound is a syllabic and comprises 
a syllable itself) ; or of its relation to a stress group; and so on. 

The fact may be emphasized that no phonetic description, no mat- 
ter how detailed, is complete. The description of sounds in the man- 
ner suggested here includes the major elements which are above the 
perceptual threshold of a particular observer and delineates the basic 
productive and controlling factors of mechanisms, air streams, and 
strictures. The descriptive technic identifies neither the exact acoustic 
nature nor the exact articulatory nature of the sounds, but it does 
serve to give a rough analysis of their basic productive and controlling 
features. 

To facilitate convenience in reference and to approximate tradi- 
tional nomenclature more closely, the description is modified in two 
ways: (i) Every explicit statement is omitted if the item which it 
mentions can be subsumed under some other part of the label. 
“ Voiced,” for example, implies glottal point of articulation, with a 
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type of articulation which constitutes a vibratory trill, and frictionless 
degree of partial closure. “ Nasalized,” when describing clicks, im- 
plies the presence of a second air mechanism which has lung initiator, 
egressive air stream, vibrating vocal cords, and open nasal passage. 
(2) The descriptive order is reversed. Qualifiers are mentioned first, 
beginning with the ones of lowest rank and working progressively 
toward the acme stricture; the acme stricture follows the qualifiers; 
secondary or primary valvate strictures come next, if mentioned at 
all (all sounds are assumed to have a secondary valvate stricture 
unless some term such as “ nasal ” prevents this assumption) ; direc- 
tion of the air stream is then noted (and is taken to be egressive unless 
some factor specifically states or implies otherwise; “click,” for 
example, implies ingressive air stream) ; choice of initiator, or major 
air mechanism, is next in order (voiceless egressive lung air is assumed 
unless some factor implies otherwise) ; the final item states the degree 
of interruption of the air stream and the subclassification of the type 
if the sound is a continuant (if under the primary stricture it does 
not prove convenient to mention the degree of closure, this final item 
will usually imply it for partial strictures). 

A few names may be given to illustrate the results of utilizing this 
convenient abbreviated type of labeling for the description of vocal 
sounds: 


Voiced palatalized bilabial stop 
Voiced alveolar lateral fricative 
Voiced bilabial nasal 
Voiced high front vocoid 
Velar stop 

Alveolar grooved fricative (or alveolar sibilant) 

Nasalized alveolar click stop 

One can apply a symbol to each or to many of the factors listed for 
description. This produces a type of analphabetic notation in terms 
of productive and controlling mechanisms. It adds little to the data 
included in the more common type of abbreviated description just 
given but has some value for calling to one’s attention the many 
assumptions implicit in short labels. Of course, even an analphabetic 
system is limited to noting those elements which are above the per- 
ceptual threshold of the observer; it is suggestive, but by no means ex- 
haustive. 
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FUNCTIONAL ANALPHABETIC SYMBOLISM 


M productive mechanism 

p percussion mechanism 
i induction mechansim 
s scraping mechanism 
a air-stream mechanism 
I initiator 

1 for lung air 
p for pharynx air 
m for mouth air 
mm for a minor mech- 
anism 

e for esophageal air 
D direction of the air stream 
e egressive 
i ingressive 

C controlling mechanism 
V valvate stricture 
v velic stricture 
o oral stricture 
e subvalvate esophageal 
stricture 

g subvalvate glottal stricture 
I degree of air-stream interrup- 
tion 

c complete (stops) 
p partial (continuants) 
f frictional 
v nonfrictional 
o fricative oral 
c central 
1 lateral 
r resonant oral 

c central (vocoid) 

1 lateral 
n nasal 

n resonant nasal 
f fricative nasal 
Rank of stricture 
A acme 
P primary 
Ps subprimary 
S secondary 
T tertiary 
j adjunct 

Features of stricture 
p point of articulation 
1 labial 
d dental 
i interdental 


a alveolar 
p palatal 
s velar 
u uvular 
v velic 
f faucal 
h pharyngeal 
g glottal 
w arytenoid 
e esophageal 
a articulator 
1 lips 
d teeth 
t tongue tip 
c blade of tongue 
m middle or front part of 
tongue 

a mid-back part of 
tongue 

b back part of tongue 
r root of tongue 
s velum 
u uvula 
v velic 

f faucal pillars 
h pharyngeal wall 
i epiglottis 
g vocal cords 
w arytenoid cartilages 
e orifice of the esophagus 
d degrees of articulation 
t in time 
1 long 
n normal 
s short 

e extensive (region: front 
to back) 
r restricted 

n narrow (side to side) 
w wide 
h high 
c close 
o open 
m mid 

c close 
o open 
1 low 
C close 
o open 
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f with local friction 
v with cavity friction 
w with whistle timbre 
t types of articulation 
n normal 
f flap 
i iterative 
c chatter 
t trill 

v vibratory trill 
z laryngealization 
p pitch 
h high 

m medium or 
normal 
1 low 

f false types 
s spasmodic 
r relative strengths 

a of articulating move- 
ments 
f fortis 
n normal 
1 lenis 

s of acoustic impressions 
1 loud 
n normal 
s soft 

s shapes of articulators 
f flat 
g grooved 
r rounded 
c convex 
o contracted 
e expanded 
s straight 


u cupped 
r retroflexed 
h humped 
a retracted 
p protruded 

n with central air escape 
I with lateral air escape 
; adjuncts 
c central 

I unilateral 

II bilateral 

e extensive contact 
s slight contact 
S segmental type 
c crest 
t trough 
g crossing glide 
b bulge of stricture 
t time bulge 
i instrumental 
r real 

p perceptual 
F function phonetically 

S of the segment in the syllable 
v vowel 

s syllabic contoid 
n nonsyllabic vocoid 
c consonant 
t trigger segment 
a arresting segment 
p presyllabic 
0 postsyllabic 

G of a syllable in a stress group 
h heavily stressed 
m moderately stressed 
w weakly stressed 


A few formulas may be given to illustrate the application of this 
analphabetic system. The stops are unreleased segments. Since the 
sounds are all in isolation each one constitutes a syllable. 

[t] : ilfallDcCVvelcAP^aatdtknransfsSiFSs 

[f] : jl/aIlDeCV'velpfocAP/jdaWtlf(nransnifSrpFSs 

[n] : MaIlDcCVoeIpvnnAPj>aat<itUnransnsfSf>vavdtlvtnransssfT/>gagrftlwvtitvra- 
nsnsfSrpFSs 

[ 0 ] : MaIlDeCVveIpvrcAPf>laldtlwmovtnransnsrjllsAPf)sabdtlmctnransnsfTf>gog- 
(/tlwvtitvransnsfSrpFSv 
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Labialized [s] : MallDeCVvelpfocAP/iaacdtleftnransnsgjllePs^laldtlwmovlnrans- 
mr/lisSrpFSs 

Bilabial dick stop: MalmDiCIcAP^laWtUnnimfSiFSs 

Voiced bilabial implosive stop: JMalpDiCVvelcAP^laldtnfnransnsf/MallDeCVv- 
elcAP^laldtntnransnsfT^gagdtnwvJitvzansnsfSrpFSs 

These formulas conclude the monograph. To trace through their 
symbols is to see the unfolding of a phonetic theory which attempts 
to set forth the interrelated phenomena that contribute to the produc- 
tion of vocal sounds. The most significant usage of these sounds is 
as speech signals. Yet the intricacies of speech systems are best 
understood by viewing them in the light of a phonetic but nonlinguistic 
analysis of the most basic of all speech subunits — the segments. Seg- 
ments may be combined into larger units, that is, into phonemes, 
which are the basic speech units that make up the morphemes, words, 
and phrases of grammar. The analysis of phonemes requires a sep- 
arate type of treatment from that presented here. The analytical 
procedures of phonemics have already provided us with the requisite 
technics. Even these practices, however, may well be modified and 
clarified by correlating them with a more rigorous phonetic theory 
than has so far been available. The present study prepares the way, 
therefore, for a reformulation of phonemic statement. 
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INDEX 


Accentuated sounds, 139 
Acme stricture, 133 
in description of sounds, 151, 153 
for two simultaneous mechanisms, 
I3S 

Acoustic criteria, for classification, 69- 
73 

Acoustic technic; see Auditory acoustic 
technic, Instrumental acoustic 
technic 

Active air chambers; see Air chambers 
Adjuncts 
center, 136 

in description of sounds, 151 
side, 136 
symbolized, 155 
Affricates, variants of [ts], 113 
Air chambers (see also Chambers) 
active, 86 
definition of, 86 
passive, 86 
quiescent, 10 
semipassive, 87 

side chamber, diagrammed, 87 
zero type, 94 
Air streams 

capped: 139; in double stops, 103; 
frictionalizing stops, 135 ; in pha- 
ryngeal mechanism, 90-91 ; re- 
leased by percussor, 104 ; in stops, 
88-89 
of clicks, 8 

with combinations of mechanisms, 98 
degree of interruption: in descrip- 
tion of sounds, 151-52; symbol- 
ized, 154 

direction of: assumed, 153; symbol- 
ized, 154 

duration of, 91, 93 

egress of: in continuants, 139; in 
oral mechanism, 94 
egressive norm, 6-9 
escape cavity for, 133 
with glottal trill, 96 


of glottalized stops, 8 

impeded, by nonresonants, 144 

obstruction of, as criterion, 67 

with percussors, 104 

pressure of: influencing friction, 139; 

in syllables, 118 
for trills, 92 

with valvate strictures, 130 
in vocal sounds, 85 
escape of, in vowels, 68 
Air-stream mechanisms, 85-103 

combinations of major types, 94-99 
in description of sounds, 151-54 
limits of, 89 

major: 89-99; i n speech sounds, 
149 

minor: 89, 99-103; in nonspeech 
sounds, 149 

with oral and pharyngeal mecha- 
nisms, 98-99 
potential, 89 
pulmonic, 89-90 

reversed: compressive, 102; dia- 
grammed, 101 
symbolized, 154 

Alphabets; see Analphabetic symbol- 
ism, Transcription 
Alveolar, 123 
Alveolar arch, 122 
Analphabetic symbolism, 153-56 
assumptions shown by, 153 
formulas of, 155-56 
symbols for, 154-55 
value of, 84 
Animal calls, 32 n. 

Approaches 

in analysis of stops, 132-33 
double, 108 

Articulation ( see also Point of articu- 
lation, Strictures) 
degrees of, symbolized, 154-55 
flap type, 124-25 
fortis, 128 

iterative: 125-29; chatters, 125; 
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trills, 125-26; vibratory trills, 
125-28 
lenis, 128 
normal type, 124 
point of, normal, 123 
region of, 122 
shapes of, symbolized, i55 
spasmodic: 129; in speech, 150-51 
types of, in description of sounds, 

151 

Articulators, 120-21 
as acme stricture, 133 
definition of, 120 

diagram of potential articulators, 
120 

displaced, 122 
norms of contact for, 79 
position, normal, 122 
potential, 121 

shapes of: 137; in description of 
sounds, 151 
size of, 137 
in speech, 150 
symbolized, 154 
types of, symbolized, 155 
valvate, 130 

Articulatory criteria for classification, 
67-69 

Articulatory technic, 14-25, 4° ( see 
also Instrumental articulatory 
technic) 
definition, 14 
imitation labels in, 16-24 
perceptual threshold in, 15 
prosodic limitation, 19 
in segmentation, 23-24 
similar sounds by different move- 
ments, 14 
in teaching, 24 
test for, 16-17 

in vowel-consonant classification, 69, 
72-73 

Arytenoid cartilages, 120-21 

Aspiration as fluctuant: 46; segmen- 
tation of, 50, III 

Assumptions 

of air-stream direction, 153 
of degree of closure, 153 
of esophageal closure, 59, 64 
of functional differences: 56-64; 
types of contact, 63 


indicated by analphabetic symbols, 
153 

of normality for speech sounds, 5-9 
of norms of degree of stricture, 147- 
48 

of primary-secondary differences, 
59-61 

of secondary valvate stricture, 153 
in the term “ nasalized,” 153 
in the term “ voiced,” 152-53 
of velic function, S9-60 
of vowel-consonant division, 66 
Auditory acoustic technic, 26-28 
definition, 14 

necessity of standard for imitation 
in, 31 

in prosodic analysis, 27 
Auditory analysis 
essential, 31 

and perceptual thresholds, 31 
Auditory articulation technic; see Ar- 
ticulatory technic 

Bartholomew, 19 
Beach, 13 

speech sound, phone, 44-45 
on tone, 29 
Belches 

by esophagus air, 99 
spasmodic, 129 
subvalvate stricture with, 130 
timbre, 100 
types of, 100 

Bell, 8, 10, 13, 33, 37, 57-58 
Bender, 39 
Bergen, 17 
Bilabial, 123 
Blade of tongue, 120-21 
Bloch, 39, 51, 53, 59-62, 66, 76, 79 
Bloomfield, 6-8, 10, 14, 16, 21-22, 33- 
34, 42, 54, 60-61, 67-68, 70-72, 75, 
77 

continuum, 45 

exactness in transcription, 23 
phonemics in phonetic writing, 46 
phonemics in the laboratory, 30, 47- 
48 

vowel and consonant, phonemic and 
phonetic use of terms, 78 
Brpndal: 42; sound, phoneme, 45 
Bulge of stricture, m-12 
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Cacuminal, 123 
Candle blowing, 34-3 5 
Capped air streams, 88-89, x 39 
Cardinal vowels, 137 
Carmody, 8 

Catford 32, 37, 57, 95; on stops and 
clicks, 12-13 

Cathode ray tubes, pedagogical use of, 
3i 

Cavities (see also Chambers, Eso- 
phageal, Laryngeal, Nasal, Oral, 
and Pharyngeal cavities) 
acme stricture in highest ranking, 
133 

diagram of, 86 
number of, 86 
rank of, 130-31 
strictural function in, 61-62 
Cavity friction; see Friction, cavity 
Central: 122; resonant and nonreso- 
nant, 143-44 
Cerebral, 123 

Chambers (see also Air Chambers, 
Cavities) 

active, in description, 133 
diagram of, 86 
extent of, 9 
function of, 9 
Chatters, 125 
Cheek 

in minor mechanisms, 102 
trill of, 125 
Chest pulses, 26, 116 
Classification criteria 
acoustic, 3, 69-73 
air-stream interruption, 139 
articulator and point of articulation, 
79 

articulatory, 67-69 
border line, speech and nonspeech 
sounds, 36-37 

cavity function in nasals, 79 

of consonant-vowel division, 66-79 

contextual, 73-78 

degree, 137-38 

distribution, 74-77 

frequency of syllabicity, 143-44 

friction, 70-72, 79, 138, 140-41 

movement, 3, 68-69 

narrowing, 67-68 

obstruction of the air stream, 67 


of oral air escape in vowels, 68 
prominence, 54 
quantity, 74 

rank of cavities and strictures, 143- 
44 

rank of escape cavity, 140-41, 143- 
44 

relative force, 73-74 
side opening in laterals, 79 
sonority, 69-70, 72 
stress, 73-74 
syllable, 73 

voluntary production, 36 
whistle point, 146 
Clicks 

air stream of, 7 
Catford on, 13-14 
diagrams of, 94 

fricative, nasalized bilabial, dia- 
grammed, 98 
ingressive air stream, 88 
loud types, 94 
as marginal sounds, 6-8 
nasalized, 57> 97-98 
phonemic releases of, 130 
plus lung air, 97 

stop, bilabial: analphabetic formula 
of, 156; diagrammed, 97 
timbre of, 93 

voiced: 57, 97; acme stricture in, 
135 

Close, 137 

Close transition, 114 
Closures 

as initiating strictures, 129 
oral, rank of, 108 

Compression, for all sound types, 9 
Compressives ; see Egressives 
Concussion; see Percussor 
Consonants (see also Contoids) 
border line with vowels, 66-67 
criteria of, 66-79 
as nonsyllabic contoid, 145 
redefinition of, by phonemicist, 78 
in relation to syllable, 78 
symbolized, 155 

Contact, strictural function of, 63 
Contextual function 
for classification, 73-78 
definition of, 56 
phonemic distribution, 74-78 
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syllabic, 73 , 73 - 76 , 78 

in vowel— consonant classification, 73? 

78 

Contextual standard, 138 
Continuants 
air egress in, 139 
chart of, 142 
frictionals, 140-42 
frictionless, 140-42 
nonfrictionals, 140-42 
side adjunct in, 136 
symbolized, 134 
Continuums 

cutting of: nonsense syllables, 44; 
perceptual limits in, 45-46 ; by 
phonemics, 44 

delineation of segmentation in, aim, 
32 

description of any point, aim, S I_ S 2 
infinite division of, 43 
number of sounds in, 4S-46 
of speech, 42 

Contoids ( see also Consonants) 

approximation of consonants, 90 n. 
criteria, 78 

long, with syllable division, 117 
syllabic: 143; frequency of, 144 
symbolized, 155 
Contracted shape, 12 1 
Convex shape, 121 

Cough: 129; illustrating glottal stop, 33 
Crest 

description of, 107-8 
of speed, 116 
of syllable, 117 
from time bulge, 112 
Criteria for classification; see Classi- 
fication criteria 
Criterion of normality 
egressive air, 6-9 
voicing, 3-6 

Crossing glide, in substitution se- 
quence, 1 14 
Cunningham, 39 
Cupped articulator, 121-22 
Curry, 6, 8 

Davis, 39 
Deaf-mutes, 17 
Debilitated sound, 139 
Degree of stricture, 137-48 


affecting rank, three, 131 
affecting syllabicity, 142-43 
criteria in, 137 
in speech, 151 
symbolized, 154-55 

de Groot: 30; on definitions of speech 
sound, 47 
Dental, 123 

Description ; see Phonetic description 
Descriptive order, 151-36 
in classification, 72-73 
of closures in stops, 132-33 
illustration of, [b], 151-52 
items to be mentioned in, T51 
labeling of structures, 134 
reversed, in abbreviated labels, 153 
Dialect, transcription of, 53 
Distribution 

as criterion of classification, 74-77 
in vowel-consonant division, 74-77 
Doke: 8, 21, 103; analysis of tone, 
28 

Domal, 123 
Double release, 108 
Double stop 

fbgl diagrammed, 105 
with percussor, 105 
Double whistle point, 147 
Duration: 137-38; in syllable pulse, 
118 

Egressive click 
diagram of, 94 
timbre of, 93 

Egressives: 88; initiator in, 88 
Ejective; see Glottalized stops 
Emotions, 28 n. 

Epiglottal, 123 
Epiglottis, 120-21 
references to, 8 
tertiary, 133 
trill, 126 

as vibrator, 100 n. 

Escape cavity, 140-44 
in nasals, 133 
resonance of, 144 
Esophageal, 123 
Esophageal cavity, 86 
Esophageal mechanism, and combina- 
tions, 99-100 

Esophageal speech, teaching of, 24-23 
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Esophagus, 120-21 
closure of, 64 
trill of, 1 25 
Esophagus air, 149 
air chamber for, 99 
sound types with, 100 
used by laryngectomized patients, 
100 

Exactness in transcription, impossibil- 
ity of, 23-24 
Expanded shape, 121 
Experimental phonetics; see Instru- 
mental, Articulatory, and Acoustic 
technics 
Extensive, 137 

Fairbanks, 28 

False vocal cords: 120-21; references 
to, 8 

False voice, 128 
Falsetto, 6, 128 
Faucal, 123 

Faucal pillars: 120-21; tertiary, 133 
Faucalization, 134 
Fincman, 25, 100 
Flap, 124-23 
alveolar, 123 
labioalveolar, 123 
Flat shape, 12 1 
Fletcher, 3, 13, 27, 66 
Fluctuants 

affecting labels, 31 
different from fixed position, 48 
in segmentation, 47 
segmentation of, 52 
types, 46 

Force, as criterion of classification, 73- 
74 

Forchhammer, 9, 33, 66 
F'ortis, 128 
Fricatives, 140-42 
adjuncts in, 136 
bilabial click, diagram of, 94 
bilabial implosive, diagrammed, 92, 
96 

click, bilabial, diagrammed, 98 

distinct from scrapive, 103 

with double primary stricture, 132 

[f], analphabetic formula of, 136 

as fluctuants, 46 

with minor mechanisms, 100-2 


modified by whistles, 147 
nonpulmonic, 10-11 
with oral mechanism, 93 
with oral and pharyngeal mecha- 
nisms, 98-99 
with pharynx air, 91 
with pulmonic mechanism, 90 
[d], in time bulge, 113 
qualifier as, 134-33 
[s] , labialized, analphabetic formula 
of, 156 

[s], syllabic, 144 

velic, glottalized, diagrammed, 92 
voiced, with pharynx air, 93-96 
[X], diagram, 90 
Friction 

air-stream pressure in, 139 
cavity: 71, 138-39; as criterion, 
139-41 ; debilitated sounds with, 
139; in nasals, 132 
glottal, in vocoids and laterals, 142 
in laterals, 72 

local: 71, 138-39; as criterion, 139- 
41 ; in nasals, 132 ; pharyngeal, as 
qualifier, 134 
in nasals, 72 
oral, rank of, 108 
in trills, 146 

velic, in vocoids and laterals, 142 
in vowels, 70-72 
Friction point, 139 

Frictionalized, 134-35, 141-42 (see also 
Whisper, and “ Voiced [h] ”) 
Frictionals: 140-42; and syllabic rela- 
tionships, 143 
Frictionless spirant 

with oral mechanism, 93 
with pharynx air, 91 
with pulmonic mechanism, 90 
Function of strictures; see Strictural 
function 

Gairdner, 3-6, 34, 31, 39, 61-62 
Ganthony, on ventriloquism, 17 
Gesture, 6 
Gingival, 122 
Glides 

audible, in segment, no 
crossing, in, 114 
as fluctuants, 46 
in segmentation, 49-51 
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single-direction, no-n 
two-direction, hi 
G lottal, 123 
Glottal stops, 139-40 
acme stricture of, 133 
as fluctuant, 46 
impossible by pharynx air, 92 
repeated, 146 

in sequential glottalization, 96 
tertiary stricture only, 133 
Glottal trills, 126-27 
different pitches with, 127 
“ Glottalized ” sounds; see Sequential 
glottalization 
Glottalized stops 
air stream of, 8 
Catford on, 12-14 
combined with velic release and 
egressive-click fricative, dia- 
grammed, 99 
in English, 7 n. 
larynx in, 10 

by pharyngeal mechanism, 90-91 
releases of, diagrammed, 92 
velar, diagrammed, 92 
Glottis, during pharynx-air sounds, 95- 
96 

Grammar, basic units of, 156 
Grandgent, 22 
Gray, 39 

Groove: 121, 137; in whistles, 147 

[fi] (.see also “Voiced [h],” Vowels) 
in diagrams, 90 
friction of, 71-72 
relation to vowels, 71-72, 76 
[fl 3 ; see “ Voiced [h] ” 

Haden, 26, 47 
Harris, 27, 34 
Heepe, 60 
Herriott, 26 
Herzog, 28 
Hiccough, 129 
High, 138 
Howell, 39 
Humped shape, 122 

Imitation-label technic: 16-24; ven- 
triloquism in, 17 
Implosives 
air stream, 8 
[6], diagram of, 96 


bilabial implosive fricative, dia- 
grammed: voiced, 96; voiceless, 92 
by pharyngeal mechanism, 91 
stop, nasalized 'bilabial, diagrammed, 
97 

stop, voiced, analphabetic formula 
of, 156 
voiced, 95-96 

voiced stop plus click stop, dia- 
grammed, 99 

Individual characteristics, source of, 
22 n. 

Induced vibrator, 106 
Induction mechanism, 106 
in [g], 135-36 
in nonspeech sounds, 149 
Ingressives: 88 ; initiator in, 88 
Initiator, 87-88 
closure as, 129 
contrast with percussor, 103 
definition of, 87-88 
in detailed description of sounds, 151, 
153-54 

inward and outward, 88 
nonpulmonic, in syllable, 118-19 
pulse in, causing segment division, 
108 

reverse, 10 
in segmentation, 108 
with simultaneous acme function, 
136 

in syllable pulse, 117 
symbolized, 154 

Instrumental acoustic technic ( see also 
Sound waves) 
definition of, 14 
in finding thresholds, 29-30 
phonemics prerequisite, 30, 47-48 
postulation of sounds, 30 
in prosodic analysis, 28 
for segmentation, 30 
threshold limitations of, 31 
unimportant sounds in, 41 
Instrumental articulatory technic: 25- 
26; definition of, 14 
Instruments, measurement of shape, 
duration, pitch, 138 
Interdental, 123 

Iteratives, 125-28 (see also Trills) 
as frictionalizer, 135 
as modifier, 134 
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Jackson, 24, 25, 100 
Jaw, 120-21 

Jespersen, 6, 8, 22, 27, 56, 67, 71, 77, 
79 

Jones, 5-10, 13-15, 20, 22, 25, 31, 33, 
35-36, 46, 50-Si, 54, 58-59, 63 
speech sound, 42-44, 48-49 
value of imitation, 23 
vowel-consonant separation, 66-76 

Karlgren, 9, 34, 51 
Karr, 19-20 

Kenyon, 5-7, 9, 14, 22, 33, 42, 54, 58, 

63 

speech sound, and phoneme, 43-44, 
48-49 

vowel-consonant separation, 66-72, 
74-76 

Kimber, 39 
Kiss 

diagram of, 94 
segmentation of, 108 
Kleinfeld, 39 
Klinghardt, 34 
Kock, 30 

Krapp, 34, 37, 38 
Labels 

for abbreviated description, 153 
for groups of sounds, affected by 

phonemics, 51 
Labialization, 134 
with pharynx air, 92 
in rank, 131-32 
as secondary, 59-61 
as semipassive, 135 
Labiodental, 123 
Laryngeal cavity, 86 
Laryngeal ventricle, reference to, 8 
Laryngealization, 127-28 
false, 128 
as modifier, 134 
Larynx 

raising in singing, 18 
references to, 8 
Laterals, 122 

air stream as criterion, 141 
center adjunct in, 136 
flap, 125 
friction in, 72 


frictionalized, 134 
front contact in, 79 
resonant and nonresonant, 143-44 
strictural function in, 63 
trills, fricative, 125 
whispered, 142 
Laughter, 129 
Lenis, 128 
Levin, 24, 25 
Lip reading, 17 
Lip “ voice ” 

with mouth air, 93 
with pharynx air, 91-92 
as trill, 126 
Lips, 120-21 
as initiator, 88 
in minor mechanisms, 101-2 
trill of, 125 

Local friction; see Friction, local 
Long, 137-38 

Loud, relation to fortis, 129 
Loudness, in syllable, 118 
Low, 138 
Lowie, 34 

Lung air ( see also Pulmonic mecha- 
nism) 

ingressive, 149 
with mouth air, 97-98 
in pulmonic mechanism, 89-90 
symbolized, 154 
Lungs, as initiator, 88 
Luthy, 32 

Marginal sounds, definition of, 5 
Masking, of velar friction, 132 
Mechanisms (tee also Air-stream, 
Controlling, Induction, Minor, 
Oral, Percussion, Productive, Re- 
versed, Scraping, and Static mech- 
anisms) 

combinations of, in speech, 150 
controlling: in speech, 150; symbol- 
ized, 154 

in description of sounds, 151-54 
productive, symbolized, 154 
Metallic sound, 22 n. 

Mid, 138 
Miller, C. B., 6 
Miller, D. C., 6, 37 

Minor air-stream mechanisms, 99-103 
cheek, 102 
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combinations with, 101-3 
limits of sound types, 100 
symbolized, 134 
Modifiers, 134 
Morrison, 25, 100 

Mouth air ( see also Oral air-stream 
mechanisms) 
with lung air, 97-98 
in oral mechanism, 93 
percussors with, 104 
symbolized, 134 
Movement 

as criterion of classification, 68-69 
in stops, 108 
Muckey, 8, 22 
Murmur, 6 

Narrow, 137 
Narrowing 

as criterion of classification, 67-68 
strictural function of, 63 
Nasal cavity 
bounds of, 83 
rank of, 130 
Nasal passage; see Velic 
Nasalization: 134; of clicks, 97- 
98 

“ Nasalized ” stops, as fluctuants, 46 
Nasals 
chart of, 142 

classification by oral closure, 133 
differentiated from vowels and 
stops, 79 
as fluctuants, 46 
fricative, 141-42, 144 
friction in, 72 
frictionalized, 134 
in induction mechanisms, 106 
[nl , analphabetic formula of, 156 
nonpulmonic, 10-n 
with pharyngeal friction, 142 
qualifiers in, 133-34 
relation to stop, 62 
resonant, 141-44 
types of, 140 

valvate stricture with, 130-31 
Nasopharynx, 83 
Negus, 8, 32, 39 
Nicholson, 3, 9, 34, 67, 70 
Noel-Armfield: 3-9, n, 20, 22-23, 23, 


32-36, 42, 49-5°) 54i S7> 59> 6l > 
63, 66-67, 70-72, 73-76; speech 
sound and phoneme, 43 
Nonfrictional: 140-42; and syllabic 
relationship, 143 
Nonresonants, i43~44 
Nonsense syllables, segmentation in, 
52 

Nonspeech sounds, 32-41 
animal calls, 32 n. 
boundary with marginal sounds, 3 
boundary with speech sounds, 14 
different from speech sounds, 149-51 
to illustrate phonetic alphabet, 33 
to illustrate speech sounds, 32 
inverse, 32 

as pedagogical illustrations, 37 
from physiological conditions, 39 
postulation of, 40 
transcription of, 38-39 
for transcription practice, 34 
Nonsyllabics 
contoids, 14s 
definition of, 117 
vocoids, 143 
Nostrils, 130 

Open, 137 

Open transition: in; in crossing glide, 

114 

Oral air-stream mechanism, 93-95 
expansion of, 93 
limits of sounds by, 93 
mouth air in, 93 

plus pharyngeal mechanism, 98-99 
plus pharyngeal and pulmonic mech- 
anisms, 99 

sound types with, 99 
plus pulmonic mechanism, 97-98 
sound types with, 93, 97 
variations of tongue position in, 94 
Oral cavity, 86 
bounds of, 83 
rank of, 130-31 

Oral sounds, valvate stricture with, 
130 

Orals, 140-44 
chart of, 142 

fricative: central, 141-44; lateral, 
141-44 
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nonresonant, 143-44 

resonant: central, 141-44; lateral, 

141-44 

Order ; see Descriptive order 
Palatal, 123 

Palatalization: 134; rank of, 13 1 
Palate, 122 
Palatograms, 23 
Parmenter, 25, 47 

Passive air chamber; see Air chamber, 
passive 

Passive stricture, 13s 
Passy: 3-9, 22, 32-33, 42, 43, 30, 39, 
63, 66, 68, 70; glides, 49 
Pepinsky, 8 
Perceptual thresholds 
in analphabetic symbolism, 153 
in articulatory technic, 13-16 
conditioned by phonemics, 13-16 
in description of sounds, 151-52 
found by instruments, 29-30 
in passive labialization, 136 
of percussors, 103-4 
in segmentation, 108-10 
Percussion 
in classification, 10 
in laryngealization, 127-28 
Percussion mechanisms, 103-3 
combination with other mechanisms, 
105 

in speech, 149 
symbolized, 154 
Percussive transition, 113 
Percussive-transition vocoid 
with mouth air, 93-94 
with pharynx air, 91 
Percussives, 106 

in close transition, 114 
flaps, 123 
Percussor 

causing segmental center, 114 
contrast with initiator, 103 
in double stops, 132 
perceptual threshold of, 103 
types of, 103 
Pharyngeal, 123 
Pharyngeal cavity 
bounds of, 86 
rank of, 130-31 


Pharyngeal mechanism 
description of, 90-92 
glottalized stops by, 91 
glottis during, 95-96 
implosives by, 91 
pharynx air in, 90 
plus oral mechanism, 98—99 
plus oral and pulmonic mechanisms, 
99 

plus pulmonic mechanism, 93-96 
sound types by, 91 
Pharyngeal stops, 139-40 
Pharyngealization, 134 
Pharynx, 120-21 (see also Throat) 
references to, 8 
strictural function in, 62-63 
tertiary, 133 
trill, 126 
Pharynx air 

frictionless continuants with, 139 
glottis during, 92 
ingressive, 150 
percussors with, 104 
in pharyngeal mechanism, 90-91 
sounds diagrammed, 92 
sound types by, 91 
subvalvate stricture with, 130 
symbolized, 134 
whistles by, 147 

Phone ( see also Phonetic unit, Speech 
sound) 

description of, 113-16 
instrumental, 113-16 
perceptual, 113-16 
real, 1 15-16 

Phonemes (see also Distribution) 
abnormal position in, rare, 130 
analysis of, 156 

from combined segments, 114, 136 
definition of, 42 
grouping of releases in, 103 
influencing segmental perception, 
108-9 

number of segments in, 53 
number of, in sequence, limits to, 
150 

in phonetic descriptions, 11-12 
relation to speech sound, 42-44 
segment transcription of, 33 
in singing, 20-21 
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system of, conditioned by syllable, 
144 

variants, with ingressive lung air, 
149 

Phonetic alphabet; see Transcription 
Phonetic description (see also De- 
scriptive Order) 
abbreviated form of, 152-53 
acoustic, 11 
articulatory, 11 

detailed description of [b], 151-52 
incompleteness of, 152 
phonemic, 11-12 
phonemic interference with, 35 
in relation to context, 77 
of sounds in isolation, 77 
strictural function in, 64-65 
Phonetic fraction 
definition of, 52, 116 
in description of stops, 133 
Twaddell on, 50 
Phonetic unit 

description of, 115-16 
identical segments, 52-53 
Phonetics, definitions of, 35 
Phonograph records, as model, 31 
Pike, 28 
Pitch, 137-38 
high, 138 
low, 138 
mid, 138 

Plosives; see Stops 

Point of articulation, 122-24 (see also 
Articulation) 
arbitrary norms for, 124 
infinite division in, 124 
in speech, 150 
symbolized, 154 
Position 

abnormal, rare in phonemes, 150 
fixed, 48-49 
fluid, 48-49 

Postpostsequential, 114 
Postpresequential, 114 
Postsequential, 114 
Potential air-stream mechanisms, 89 
Potential percussion mechanisms, 104 
Prepostsequential, 114 
Presequential, 114 
Pressure, in stops, 108 
Pressure sounds; see Egressives 


Primary cavity, 130-31 ( see also Cav- 
ities, Strictures) 

Primary stricture 

in description of sounds, 151 

double, 132 

oral, 13 1 

passive, 135 

valvate, 131 

Productive mechanisms: 85—106; in 
speech, 149 

Prominence, in syllables, 54 , ii 7 _i 8 

Pronovost, 28 

Proscdy, in articulatory descriptions, 

19 

Protruded, 122 

Puff of breath, illustrating aspiration, 
33 

Pulmonic mechanism: 89-90 
lung air in, 89-90 
plus oral mechanism, 97-98 
plus oral and pharyngeal mechanism, 
sound types with, 99 
plus pharyngeal mechanism, 95-96 
sound types with, 89-90 

Qualifiers: 133-35; in description of 
sounds, 153 

Quantity, as criterion, 74 

Quiescent air chamber, in percussives, 
10 

Rank (see also Strictural function) 
affecting syllabicity, 143 
descending series of, 107-8 
of modification, 134 
primary, three degrees of, 13 1 
of qualifiers, in short labels, 153 
in sounds with two mechanisms, 95 
of stricture: in segmentation, 107-9; 
symbolized, 154 

Rarefaction, for all sound types, 9 

Rarefactives ; see Ingressives 

Release 

affricated: 94; in pharynx air, 92 
in analysis of stops, 132-33 
of capped air stream, by percussor, 
104 

double, 108 
glottal, 92 

percussive-transition vocoid, dia- 
gram of, 92 
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with percussor, 103 
phonemic, of clicks, 150 
time bulge in, 113 
velic, in glottalized stop, 99 
vocoidal, 94 
Resonants, 143-44 
nonvocoid, sonorants, 144 
Restricted articulation, 137 
Retracted, 122 

Retroflexion: 122-23; in whistles, 147 
Reversed mechanism, 88 
compressive, 102 
diagrammed, 101 

Ripman, 5-8, 14, 33, 50, 58-59, 62, 67, 
72 

Root of tongue, 120-21 
Rounding: 121; in whistles, 147 
Rousselot, 6 
Russell, 8, 22, 25 

Sapir: n, 24, 34, 40, 96; candle blow- 
ing versus speech, 35 
Schall, 23 

Scraping mechanism: 105; in non- 
speech sounds, 149 
Scrapives, 105 
Seashore, 6 

Secondary cavity, 130-31 
Secondary strictures 
in description of sounds, 151 
nasal, 132 
passive, 135 
Segmentation, 107-20 
basic to speech units, 46 
of fixed and fluid positions, 51 
of fluctuants, 52 
of glide, 49-si 
instrumental, 47-48 
natural, 47, 52 

nonfictitious, 4, 23-24, 42, 46-47, 52 
of nonsense syllables, 52 
perceptual variation in, 52 
phonetic fraction in, 50 
possibility of, 23-24 
rank of stricture in, 107-8 
in speech, 150 
Segments 

acoustic delimitation of, 30 
and alphabets, 46-47 
arresting, 119 
aspiration, 50 


center of, 107, 114-15 

combined into phonemes, 156 

crest, 107-8 

description of, 107 

division in initiator movement, 108 

double release in, 108 

glides in, 110-11 

instrumental: 109, 116; in close 
transition, 114 
level, no 

natural borders of, 47 
number of, in continuum, 45-46 
perceptibility, factors influencing, 
108 

perceptual, iio, 116 
real, 109, 116 

relation to syllable, 117-18 
as speech subunit, 156 
stop, 50 
trigger, 119 
trough, 107-8 
types of, symbolized, 155 
undelineated borders of, 52 
Semipassive strictures, 135 
Sepulchral voice, 22 n. 

Sequential glottalization 

of continuants, production of, 11 
of spirants, 96 

Shapes of articulators, 121-22, 137 
in speech, 150 
symbolized, 155 
Shohara, 15, 27 
Short, 137-38 

Sibilants: 153; [s], labialized, analpha- 
betic formula of, 156 
Singing 

for illustration, 6 
pharynx in, 18 
and phonemics, 20-21 
raising larynx in, 18 
throat enlargement in, 17, 19-20 
tone placement in, 18-19, 21 
tongue positions in, 18 
training in, 17-21 
vowel formation in, 19-20 
Sneeze, 129 
Snore, 125 

Snow, pitch differences with loudness, 
30 n. 

Soames, 67-68, 70 
Soft, relation to lenis, 129 
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Sonorants: 144; normality of, 8-9 
Sonority 

criterion for classification, 69-70, 7 2 
in syllable, 118 

Sound waves (see also Instrumental 
acoustic technic) 

classification of, pedagogical use of, 

3i 

limitation in study of, 30-31 
production of, 85 

Sounds: 116; processes in production 

of, 83, 85 

Spasmodic articulation: 129; in speech, 
150 

Speech sounds 

cut from continuum, 44 
definition of, 53, 116 
different from nonspeech sounds, 
I49-5I 

differentiated from phoneme, 42-44 
with fixed position, 48-49 
with fluid position, 48-49 
instrumental separation of, 47-48 
Stackpole, 39 
Standard, relative, 138 
Static air, 104 

diagram of a sound with, 105 
with induced vibrator, 106 
in segments, 114 

Static mechanisms, in double stops, 
132 

Stetson, 25-26, 45, 73 
denies continuum divisible into 
sounds, 45 

on the syllable, 53-54 
Stevens, 39 

Stirling, 5, 34, 51, 58, 67 
Stomach cavity, 86 
Stops 

air stream in, 88-89 

[b], detailed description of, 151-52 

[6], diagram of, 96 

basic points of articulation in, 124 

certain labels for, 153 

click, bilabial 

analphabetic formula of, 156 
nasalized, diagrammed, 97 
voiced, diagrammed, 98 
description of, 139 
descriptive order of closures in, 132- 
33 


double: function of strictures in, 60; 

with percussor, 105 ; rank in, 132 
frictionalized, 135 
glottal ; see Glottal stops 
glottalized; see Glottalized stops 
implosive, voiced 

alveolar, diagrammed, 97 
bilabial, analphabetic formula, of, 

156 

interdental twisted, 122 
[k], rank of strictures in, 133 
with minor mechanisms, 100-1 
as nonresonants, 143-44 
number of closures in, 140 
with oral mechanisms: 93; and with 
pharyngeal mechanisms, 98-99 
[p], diagram of, 90 
with percussor, 104 
pharyngeal, 139-40 
with pharynx air, 92 
plus bilabial click stop, diagrammed, 

99 

with pulmonic mechanism, 90 
rarefactive, 89 

releases in: 57; segmentation of, 50 
syllabic, in isolation, 144 
symbolized, 154 

[t], analphabetic formula of, 155 
voiced implosive, 95-96 
voiced, ingressive, 96 
voiceless, as phonetic entity, 108 
Straight articulator, 121 
Stress, as criterion cf classification, 73- 
74 

Stress groups: 119; function in, sym- 
bolized, 155 
Stressed syllables, 119 
Strictural function, 56-65, 129-36 (see 
also Rank) 

air-stream interference in, 136 
of cavities, 9, 61-63, 136 
of contact and narrowing, 63 
contrasting types of, 57-62 
definition of, 56 

descending series of rank; 107-8 
differences of degree of stricture, 
136 

of the esophagus, 59 
initiating stricture, 129-30, 136 
in laterals, 63 
in nasals, 62 
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of pharynx, 62-63 
in phonetic description, 64-65 
of secondary strictures (labializa- 
tion, etc.), 59-61 
in speech, 151 

two functions of one stricture, 57 
valvate, 129-30 

of the velic: 58-60; closure of, al- 
ways valvate, 132 
in vowels, 61-63 

Strictures ( see also Articulation, Crest, 
Degree of stricture, Trough) 
acme, 133 

addition of, in crest, 107 
bulge of, in-12 
complete, 138 
definition of, 120 

degree of: affecting syllabicity, 142- 
43; in description of sounds, 151; 
in speech, 151; in syllable, 118 
double primary, 132 
duration of: long, 137-38; short, 
137-38 

initiating, 129 
number of, in sounds, 64 
partial: cavity, 138-39 ( see also 
Friction, cavity) ; local, 138-39 
( see also Friction, local) 
passive, 135 
primary, oral, 13 1 
primary valvate, 13 1 
ranks of (.see also Strictural func- 
tion) ; in description of sounds, 
151—53 ; primary, three degrees of, 
13 1 ; symbolized, 154 
region of: close, 137; extensive, 137; 
extent of, 137; narrow, 137; open, 
137; restricted, 137; wide, 137 
secondary, nasal, 132 
subprimary, 131-32 
substitution of, 107 
subtraction of, in trough, 107 
symbolized, 154-55 
tertiary, pharyngeal, 133 
valvate: 129-30; in description of 
sounds, 151, 153 
Subjective phenomena, 30 
Subprimary: 131-32; in description of 
sounds, 151 

Substitution sequence, 114 
Subvalvate, 130 


Suction sounds; see Ingressives 
Sweet: 5-8, 10, 14-15, 20, 22, 32-33, 
33, 42, 45, 48-50, 59, 62-63, 67, 
70-72, 79; glides, 49 
Syllabics 

chart of frequency of types, 144 
contoid, 145 
definition of, 117 

groups by frequency: 143-44; de- 
gree of stricture affecting, 142- 
43 

nonresonants, rarely, 144 
resonants, frequently, 144 
vocoid, 145 

Syllables, 116-20 (see also Syllabics, 
Nonsyllabics) 
acoustic criteria for, 54 
affecting phonemic systems, 144 
articulatory criteria for, S3-54 
checked, closed, 119 
crest of, 117 

as criteria of classification, 73, 75- 
76, 78 

definition of, 116 
in description of sounds, 151-52 
effect on alphabets, 76 
free, open, 119 

function: symbolized, 155; labels 
for, phonemic influence in, 145 
instrumental, n6 
nucleus of, 117 
perceptual, 117-18 
produced by nonpulmonic initiators, 
118-19 

prominence in, 117 
real, 116-17 

relation to segments, 117-18 
speed of, 20 

Symbols; see Analphabetic symbolism, 
Transcription 

Teeth, 120-21 
Tertiary cavity, 130-31 
Tertiary stricture 

in description of sounds, 151 
passive, 135 
pharyngeal, 133 

Thresholds; see Perceptual thresholds 
Throat (see also Pharynx) 
in singing, 19-20 
in ventriloquism, 17 
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Timbre 
belch, ioo 
click, 93 
glottalized, 91 
of [h], 142 
implosive, 91 
percussive, 103 
vowel type, 13 1 
whistle, 147 
Time ; see Duration 
Time bulge, 111-13 

of initiator: 117; in syllable, 116 
[ts] in, 113 
Tip, 120-21 
Tone languages 

instruments in analysis of, 28-29 
intervals in, 27-28 
Tone placement, in singing, 18-19, 21 
Tongue, 120-21 
back of, 120-21 
as initiator, 88 
middle of, 120-21 
in oral mechanism, 93-94 
as percussor, 103 
in rank of cavity, 130 
root of, 120-21 
shape of, in whistles, 147 
side of, trilled, 125 
tip of: 120-21; in minor mecha- 
nisms, 101-2 
Tongue tip, trill, 123 
Trager, 39, 5U 59. 60-62, 66, 76, 79 _ 
Transcription ( see also Analphabetic 
symbolism) 

affected by syllable, 76 
of dialects, 53 

exactness in, impossible, 23-24 
for phonemic analysis, S3 
and segments, 46-47 
Transition sound: 113; percussives, 10, 
115 

Travis, 39 
Trevino, 25, 47 
Trillization, 126-27 
false, 128 
as modifier, 134 
Trills, 125-28 

as continuants, '146 
fricative, 146 
frictionalized, 135 


frictionless, 146 

glottal, instead of voicing, 96 

modified, 135 

as modifier, 134 

with pharynx air, 91-92 

points of articulation in, 125-26 

with pulmonic mechanism, 90 

not spasmodic, 129 

triple, 135 

uvular, with whistles, 147 
vibratory: 125-28, 146; laryngeali- 
zation, 127-28; trillization (see 
Trillization) ; voice (see Voice) 
Trough 

description of, 107-8 
between syllables, 116 
Trubetzkoy, 53 
Twaddell, 29, 42, 50 
Twisted shape, 122 

Types of articulation: 124-29; sym- 
bolized, 155 

Units of sound, 42-55. 107-19 (see 
also Phone, Phoneme, Segment, 
Speech sound) 

Uvula: 120-21; trill of, 125 
Uvular, 123 

Valvate strictures or articulators, 129- 
30 

closure, not qualified, 134 
in description of sounds, 151, 153 
primary, in nasals, 13 1 
subvalvate, 130 
symbolized, 154 
Velar, 123 
Velarization, 134 
Velic, 120-21, 123 

closure in stops, rank of, 132-33. 
140 

friction, 142 

function, 58-60 

in induction mechanism, 106 

as percussor, 103 

place of, 85 

trill, 125 

Velum, 120-21; see also Velic 
Ventriloquism, 17 
Vibrato, 6 

Vibratory trills, 146 
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Vietor, 5, 9, 33, 67 
Vocal Cords, 120-21 ( see also Voice) 
as percussor, 103 
photographed, 26 
trill of, 123-26 
Vocoids (see also Vowel) 
adjuncts in, 136 
approximation of vowels, 90 n. 
criteria for, 78 

debilitated, with trillization, 139 
double, whistle, 147 
with double primary stricture, 132 
friction during, at glottis, 13 1 
frictionalized, 134-35, 141-42 
long, with syllable division, 117 
nonsyllabic, 145 

[o], analphabetic formula of, 156 
with oral mechanism, 93 
percussive-transition type, 91-94 
with pharynx air, 91 
pitch by, in whistles, 147 
with pulmonic mechanism, 90 
as syllable crests, 145 
syllabic frequency of, 144 
symbolized, 134-33 
in time bulge, 112 
[u], diagram of, 90 
voiceless: 142; with glottal friction, 
144 

whispered, 140, 142 
zero types, 147 
Voegelin, 34, 74, 130 
Voice (sec also Singing, Voicing) 
compared to trillization, 127 
false, 128 

lip “voice 126; with pharynx air, 
91-92 

slow, not glottal trill, 127 
training methods for, 17-21 
vibratory trill in, 126 
“ whistle voice,” 147 
“Voiced [h],” 140, 142 
description of, 133 
friction of, 71-72 

Voicing, 146 ( see also Vocal cords, 
Voice) 

in belches, 130 
in clicks, 97 
as modifier, 134 
in oral mechanism, 99 


with pharynx air, 93-96, 99 
of sonorants, 8-9 
in vowel definitions, 3, 72, 76 
Von Hornbostel, 28 
Vowels (see also Glides, Vocoids) 
border line with consonants, 66-67 
cardinal, 137 
classification of, 22 
criteria of, 66-79 
friction in t 70-72 
glides, 48-49 
infinite variety of, 15 
ingressive, in Maidu, 130 n. 
movement as criterion for, 68 
nasalized, 61 

need for oral instruction, 23 

redefinition of, by phonemicist, 78 

in relation to syllable, 78 

rounded abnormal, 9 

in singing, 19-20 

strictural function in, 61-62 

as syllabic vocoids, 143 

symbolized, 153 

triangle, 22-23 

unrounded abnormal, 9 

voiceless (see also [h] ) ; friction in, 

138 

voicing in definitions of, 3, 72, 76 
whispered, 71-72 (see also Whisper) 

Ward, 3-6, 8, 10-11, 29, 33, 48, 37, 59- 
60, 66-72 

Westerman: 3-6, 8, 10-n, 29, 33, 37, 
59-60, 66, 70-72 ; phone, 48 
Whisper, 6, 144 
as abnormal, 5 
adjuncts in, 136 
false, 128 
friction in, 138 
friction of, 71-72 
impossible by pharynx air, 92 
lateral, 142 
as qualifier, 134 
substituted for voicing, 128 
in vowel definitions, 3 
Whistle point, 146 
double, 147 
Whistles, 146-47 
dental, 147 
double, 147 
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modified, by voice, nasalization, fric- 
tion, 147 

with oral air-stream mechanism, 93 
with pharynx air, 91-92 
“ Whistle voice,” 147 


Wide, 137 
Woo, 9, 33, Si. 70 
X rays, 25 
Yodeling, 128 
Zero sound, 139 



